Bug #33793 | Race condition between "release gci" and node-failure handling | ||
---|---|---|---|
Submitted: | 10 Jan 2008 11:52 | Modified: | 20 Feb 2008 21:48 |
Reporter: | Lars Torstensson | Email Updates: | |
Status: | Closed | Impact on me: | |
Category: | MySQL Cluster: Cluster (NDB) storage engine | Severity: | S3 (Non-critical) |
Version: | * | OS: | Any |
Assigned to: | Jonas Oreland | CPU Architecture: | Any |
Tags: | MicroGCP |
[10 Jan 2008 11:52]
Lars Torstensson
[10 Jan 2008 13:09]
Bogdan Kecman
As explained here: <http://dev.mysql.com/doc/refman/5.1/en/mysql-cluster-config-params-ndbd.html> > NoOfFragmentLogFiles=64 -> NoOfFragmentLogFiles=362 > FragmentLogFileSize=16 -> FragmentLogFileSize=32M These changes require "Initial System Restart"
[11 Jan 2008 6:36]
Jonas Oreland
If release gci has updated max-acked-gci, but not yet released last page and there is a node failure, then the node failure code contains(ed) a incorrect assertion that last page should be empty. --- Consequence is cluster-failure. --- Solution is to correct assertion. --- Note on changed subject: 1) This bug has nothing to do with changing of config parameters 2) This bug has nothing to do with upgrade I.e This is just a plain-old bug, however very unlikely as it's the first time we see it, and it's present in all mysql-version with replication
[11 Jan 2008 7:32]
Bugs System
A patch for this bug has been committed. After review, it may be pushed to the relevant source trees for release in the next version. You can access the patch from: http://lists.mysql.com/commits/40893 ChangeSet@1.2185, 2008-01-11 08:33:09+01:00, jonas@perch.ndb.mysql.com +4 -0 bug#33793 - dont assume that page is "all empty" only as gci is acked, as release_gci might not have processed it yet
[11 Jan 2008 8:19]
Bugs System
A patch for this bug has been committed. After review, it may be pushed to the relevant source trees for release in the next version. You can access the patch from: http://lists.mysql.com/commits/40895 ChangeSet@1.2529, 2008-01-11 09:20:16+01:00, jonas@perch.ndb.mysql.com +4 -0 ndb - bug#33793 dont assume that page is "all empty" only as gci is acked, as release_gci might not have processed it yet
[11 Jan 2008 9:33]
Jonas Oreland
pushed to drop6, 51-ndb, 51-telco-gca, telco-61, telco-62, telco-63, telco-64 and 51-telco
[1 Feb 2008 14:18]
Jon Stephens
Documented in 5.1.23-ndb-6.3.8 changelog as follows: A race condition could occur (very rarely) when the release of a GCI was followed by a data node failure. Left bug in PQ status pending additional merges.
[2 Feb 2008 12:05]
Jon Stephens
Also documented for 5.2.23-ndb-6.2.11; left status unchanged.
[20 Feb 2008 16:03]
Bugs System
Pushed into 5.1.24-rc
[20 Feb 2008 16:03]
Bugs System
Pushed into 6.0.5-alpha
[20 Feb 2008 21:48]
Jon Stephens
Also documented for 5.1.24 and 6.0.5.