Bug #51056 | Node can not restart after crash error 2341 | ||
---|---|---|---|
Submitted: | 10 Feb 2010 11:21 | Modified: | 16 Apr 2010 6:06 |
Reporter: | Christian Loebbert | Email Updates: | |
Status: | Closed | Impact on me: | |
Category: | MySQL Cluster: Cluster (NDB) storage engine | Severity: | S1 (Critical) |
Version: | mysql-5.1-telco-7.0 | OS: | Linux (SuSE Linux 11.1 2.6.27.39-0.2-xen) |
Assigned to: | Pekka Nousiainen | CPU Architecture: | Any |
Tags: | 7.0.7, crash, error 2341, node, restart |
[10 Feb 2010 11:21]
Christian Loebbert
[15 Feb 2010 20:59]
Pekka Nousiainen
Christian, So, node 3 is up and trying to start node 4 with --initial causes node 4 to crash. The apparent cause is that node 4 gets a duplicate tablename from node 3. In the Feb 8 log this is UserTable with id 3888. This indicates node 3 dictionary is corrupt. Look at ndb_show_tables output. If possible, upload the output and directories D1,D2 from node 3, e.g: tar cf node3fs.tar ndb_3_fs/D[12]
[23 Mar 2010 0:00]
Bugs System
No feedback was provided for this bug for over a month, so it is being suspended automatically. If you are able to provide the information that was originally requested, please do so and change the status of the bug back to "Open".
[24 Mar 2010 8:34]
Gustaf Thorslund
Christian, The output of: ndb_show_tables described at: http://dev.mysql.com/doc/refman/5.1/en/mysql-cluster-programs-ndb-show-tables.html could be useful. /Gustaf
[15 Apr 2010 8:04]
Christian Loebbert
after serveral weeks I have done a mysqldump of all databases and restarted both cluster nodes with --initial. Both nodes started without problems, but of course all my databases are lost. I have imported them from mysqldump and everything is fine. Therefore I guess output from ndb_show_tables is not usefull for your. Nevertheless I have attached that output and hope you can find something what give me the possibility to bring the cluster back to work without using a backup and a long downtime.
[16 Apr 2010 6:04]
Pekka Nousiainen
I guess you didn't see my response from Feb 15 which was mistakenly hidden from public. The problem obviously was dictionary corruption and finding out the reason is unlikely. Good you got the db up. I'll close this bug.