Bug #63630 All cluster nodes crashes whithin error 2341 in object DBLQH Line 10201 and 1072
Submitted: 6 Dec 2011 16:15 Modified: 20 Oct 2016 4:04
Reporter: Max Karpenkov Email Updates:
Status: Can't repeat Impact on me:
None 
Category:MySQL Cluster: Cluster (NDB) storage engine Severity:S1 (Critical)
Version:mysql-5.1.56 ndb-7.1.15a OS:Solaris (sparc 64bit)
Assigned to: CPU Architecture:Any
Tags: cluster, crash, error 2341

[6 Dec 2011 16:15] Max Karpenkov
Description:
I have 2 cluster nodes whithin 2 arbitration nodes on it and periodicaly all of this 2 nodes crashes whith folowing(2 crash examples):

Time: Friday 14 October 2011 - 14:31:05
Status: Temporary error, restart node
Message: Internal program error (failed ndbrequire) (Internal error, programming error or missing error message, please report a bug)
Error: 2341
Error data: dblqh/DblqhMain.cpp
Error object: DBLQH (Line: 10201) 0x00000002
Program: ndbd
Pid: 16033
Version: mysql-5.1.51 ndb-7.1.9
Trace: /var/mysql_cluster/mysqld_data/ndb_11_trace.log.4
***EOM***

Time: Thursday 1 December 2011 - 01:00:37
Status: Temporary error, restart node
Message: Internal program error (failed ndbrequire) (Internal error, programming error or missing error message, please report a bug)
Error: 2341
Error data: dblqh/DblqhMain.cpp
Error object: DBLQH (Line: 10720) 0x00000002
Program: ndbd
Pid: 21790
Version: mysql-5.1.56 ndb-7.1.15a
Trace: /var/mysql_cluster/mysqld_data/ndb_11_trace.log.5
***EOM***

How to repeat:
i dont know yet how to repeat it
[6 Dec 2011 16:17] Max Karpenkov
Two trace files are attached here...
[6 Dec 2011 16:25] Max Karpenkov
first crash file

Attachment: ndb_11_trace.log.4.gz (application/x-gzip, text), 33.16 KiB.

[6 Dec 2011 16:25] Max Karpenkov
second crash trace file

Attachment: ndb_11_trace.log.5.gz (application/x-gzip, text), 59.87 KiB.

[6 Dec 2011 16:32] Jonas Oreland
Possible fix

Attachment: fix_fragmented_70.patch (text/x-patch), 21.46 KiB.

[6 Dec 2011 16:33] Jonas Oreland
Hi,

I suspect that this is a bug that has been fixed in >= 7.1.17
and if that is the case, then it's the patch that I attached that should
do the trick.

If you can test it, it would be great!

/Jonas
[6 Dec 2011 16:34] Max Karpenkov
Can affect to this slow queries, or too many same queries???
[6 Dec 2011 16:37] Jonas Oreland
neither, if my guess is correct,
it's a particular query...

/Jonas
[6 Dec 2011 16:40] Max Karpenkov
yes, two different errors in different lines, so in 7.1.17 there is no problem?
or your patch only required???
[6 Dec 2011 16:44] Jonas Oreland
I don't understand.

I think the bug is fixed in 7.1.17, but you can never be sure without testing.
I want you to test the patch for your occurrence, and that you can do by applying it to 7.1.15a.

If testing is impossible, I'll say that I'm sure enough...
that I'll close this as "already fixed"

/Jonas
[6 Dec 2011 16:48] Max Karpenkov
ok, i'l do the test, but this situation has occured once or twice in month, or never per month...
thanks a lot Jonas!!!