Bug #58904 Ndb : FAIL_REP signal does not include source node id
Submitted: 13 Dec 2010 15:16 Modified: 6 Jan 2011 6:15
Reporter: Frazer Clement Email Updates:
Status: Closed Impact on me:
None 
Category:MySQL Cluster: Cluster (NDB) storage engine Severity:S4 (Feature request)
Version:mysql-5.1-telco-6.3 OS:Any
Assigned to: Frazer Clement CPU Architecture:Any

[13 Dec 2010 15:16] Frazer Clement
Description:
The FAIL_REP signal used inside the Ndbd kernel to declare a node failed, does not include the nodeid of the node detecting the failure.

This information is useful for debugging purposes, especially given the 'epidemic' nature of this signal's propagation through the cluster.

It should be added to the signal.

How to repeat:
Activate signal tracing.

Cause a node to fail (e.g. kill -6)

Observe signal traces in other nodes

Suggested fix:
Add a source node id
[13 Dec 2010 15:25] Bugs System
A patch for this bug has been committed. After review, it may
be pushed to the relevant source trees for release in the next
version. You can access the patch from:

  http://lists.mysql.com/commits/126662

3363 Frazer Clement	2010-12-13
      Bug#58904 Ndb : FAIL_REP signal does not include source node id
      
      Source node id should be included to aid debugging and enable more 
      intelligent failure report handling.
[13 Dec 2010 15:50] Bugs System
Pushed into mysql-5.1-telco-7.0 5.1.51-ndb-7.0.21 (revid:frazer@mysql.com-20101213153450-5czlh089vpqfo7v1) (version source revid:frazer@mysql.com-20101213153450-5czlh089vpqfo7v1) (merge vers: 5.1.51-ndb-7.0.21) (pib:23)
[13 Dec 2010 15:55] Bugs System
Pushed into mysql-5.1-telco-6.3 5.1.51-ndb-6.3.40 (revid:frazer@mysql.com-20101213152410-ozb61hffqx17fkex) (version source revid:frazer@mysql.com-20101213152410-ozb61hffqx17fkex) (merge vers: 5.1.51-ndb-6.3.40) (pib:23)
[20 Dec 2010 9:43] Jonas Oreland
pushed to 6.3.40, 7.0.21 and 7.1.10
[6 Jan 2011 6:15] Jon Stephens
Documented in the NDB-6.3.40, 7.0.21, and 7.1.10 changelogs as follows:

        The FAIL_REP signal, used inside the NDB kernel to declare that
        a node has failed, now includes the node ID of the node that
        detected the failure. This information can be useful in
        debugging.

Closed.