Bug #13461 | Slave Cluster crashed on restart of two data nodes in seperate groups | ||
---|---|---|---|
Submitted: | 24 Sep 2005 15:22 | Modified: | 14 Oct 2005 8:28 |
Reporter: | Jonathan Miller | Email Updates: | |
Status: | Closed | Impact on me: | |
Category: | MySQL Cluster: Cluster (NDB) storage engine | Severity: | S2 (Serious) |
Version: | 5.1 4.1? 5.0? | OS: | Linux (Linux) |
Assigned to: | Tomas Ulin | CPU Architecture: | Any |
[24 Sep 2005 15:22]
Jonathan Miller
[27 Sep 2005 9:33]
Jonas Oreland
1) it not supported to "take down" a node during NR This currently _should_ crash starting node. 2) Regarding the "unable to find", is this reproducable? If so how?
[27 Sep 2005 10:45]
Jonathan Miller
1) it not supported to "take down" a node during NR This currently _should_ crash starting node. > That is a bug. If it is not supported then block me from do so, que it up and run is after, but crash starting node and cluster is not exceptable. 2) Regarding the "unable to find", is this reproducable? If so how? > Not 100% sure if it reproducable, but all the files you need from it are on ndb10, ndb11, and ndb12.
[27 Sep 2005 10:51]
Jonas Oreland
>That is a bug. If it is not supported then block me from do so, que > it up and run is after, but crash starting node and cluster is not > exceptable. Do you mean from ndb_mgm, then its a bug there. Please report it separatly if you can repeat it. Otherwise it hard to block. I can _never_ block you from kill -9 or physically unplugging a cable. BTW: The cluster shouldnt fail. Is this reproducable? 2) Regarding the "unable to find", is this reproducable? If so how? >> Not 100% sure if it reproducable, but all the files you need from it >> are on ndb10, ndb11, and ndb12. Can you please try to reproduce the test case?
[27 Sep 2005 11:03]
Jonathan Miller
> Yes, the ndb_mgmd allowed me to issue the restart. Not sure why I need to open a different bug report as this is the bug report that I have open for it. > Yes this is reproducable, get yourself a large database, restart a data node, when it enters phase 4, restart another data node in the other group.
[27 Sep 2005 11:17]
Jonas Oreland
1) bug if ndb_mgm allowed you stop a node while another was restarting 2) bug if cluster fails during this (unless in different node groups) 3) bug if it then fails to perform SR. But I interpret your replies as: only 1) is relevant (and reproducable?)
[13 Oct 2005 12:42]
Bugs System
A patch for this bug has been committed. After review, it may be pushed to the relevant source trees for release in the next version. You can access the patch from: http://lists.mysql.com/internals/31021
[13 Oct 2005 15:23]
Tomas Ulin
pushed into 5.0.15 only
[14 Oct 2005 8:28]
Jon Stephens
Thank you for your bug report. This issue has been committed to our source repository of that product and will be incorporated into the next release. If necessary, you can access the source repository and build the latest available version, including the bugfix, yourself. More information about accessing the source trees is available at http://www.mysql.com/doc/en/Installing_source_tree.html Additional info: Documented fix in 5.0.15 changelog.