Bug #14408 | ndbd (MySQL Cluster) hangs in Phase 5 during recovery under load | ||
---|---|---|---|
Submitted: | 27 Oct 2005 20:50 | Modified: | 15 May 2008 13:54 |
Reporter: | Tim Heath | Email Updates: | |
Status: | No Feedback | Impact on me: | |
Category: | MySQL Cluster: Cluster (NDB) storage engine | Severity: | S2 (Serious) |
Version: | 4.1.15 | OS: | FreeBSD (FreeBSD 5.4) |
Assigned to: | Assigned Account | CPU Architecture: | Any |
[27 Oct 2005 20:50]
Tim Heath
[14 Dec 2005 6:39]
Jonas Oreland
Hi, This could be Bug #15685. The attached patch is quite big as it contains test aswell. But the bug fix is part below. This has been pushed into 4.1.17 (which I dont know when it will come out) Can you try the patch, and see if it helps? /Jonas --- 1.60/ndb/src/kernel/blocks/dbtc/DbtcMain.cpp 2005-10-21 06:41:46 +02:00 +++ 1.61/ndb/src/kernel/blocks/dbtc/DbtcMain.cpp 2005-12-12 17:19:02 +01:00 @@ -6129,7 +6129,6 @@ << " - place: " << c_apiConTimer_line[apiConnectptr.i]); switch (apiConnectptr.p->apiConnectstate) { case CS_STARTED: - ndbrequire(c_apiConTimer_line[apiConnectptr.i] != 3615); if(apiConnectptr.p->lqhkeyreqrec == apiConnectptr.p->lqhkeyconfrec){ jam(); /* @@ -6389,8 +6388,8 @@ warningEvent(buf); ndbout_c(buf); ndbrequire(false); + releaseAbortResources(signal); } - releaseAbortResources(signal); return; }//if TloopCount++;
[17 Mar 2006 10:19]
Jonas Oreland
Another question, did you ever get error code 4012?
[8 Apr 2006 11:40]
Jonas Oreland
Since we have been able to reproduce this, could you try a newer 4.1 version...there has been a few bug fixes that could match your problem
[17 Apr 2006 23:00]
Bugs System
No feedback was provided for this bug for over a month, so it is being suspended automatically. If you are able to provide the information that was originally requested, please do so and change the status of the bug back to "Open".
[4 Jan 2007 19:32]
Matthew Boehm
Hello, I am reopening this bug because I am running 5.0.27 and are experiencing the same issue. I am running cluster in the same config (1 mgm, 2 nodes also running api). Node 2 crashed a few days ago (unable to contact other nodes) and when I tried to bring it back, it hung in phase 5. I killed the processes on that machine. Removed all ndb_X_* files and directories from /var/lib/mysql/ and started back up with --initial and still gets hung in phase 5. I too are seeing TIME_WAIT in netstat from itself, over 1300 of them. What info do you need from me? Right now Node 1 is on so the cluster is working in production.
[15 Apr 2008 13:54]
Susanne Ebrecht
All reporters, We fixed lots of bugs. Please test with newer version of MySQL (also FreeBSD) and let us know if you still will get this problem.
[15 May 2008 23:00]
Bugs System
No feedback was provided for this bug for over a month, so it is being suspended automatically. If you are able to provide the information that was originally requested, please do so and change the status of the bug back to "Open".