Bug #33951 | SQL/API nodes crash or hang repeatedly when my.cnf contains same server-id value | ||
---|---|---|---|
Submitted: | 21 Jan 2008 0:38 | Modified: | 31 Oct 2008 19:10 |
Reporter: | Jason Brooke | Email Updates: | |
Status: | Can't repeat | Impact on me: | |
Category: | MySQL Cluster: Cluster (NDB) storage engine | Severity: | S1 (Critical) |
Version: | 5.0.45, 5.1.22rc | OS: | Linux (RHEL 5.1) |
Assigned to: | Don Kehn | CPU Architecture: | Any |
Tags: | same server-id crash hang |
[21 Jan 2008 0:38]
Jason Brooke
[6 Feb 2008 14:03]
Sveta Smirnova
Thank you for the report. If I understood you correct you say same server id leads one of API nodes to hang? Please confirm or reject.
[6 Feb 2008 14:21]
Jason Brooke
That's correct Sveta. There seems to be two symptoms: - both API nodes randomly crashing - both API nodes get hung queries in processlist, which prevents most subsequent queries from completing, even against other tables I haven't been able to find the exact condition that triggers these symptoms, I only that once I correct the misconfiguration, it all runs beautifully.
[6 Feb 2008 15:04]
Sveta Smirnova
Thank you for the feedback. Status was changed to "Verified" by mistake. I could not repeat hang with test data. Could you please describe which type of queries "hang" in your case? How much time servers run smoothly before hang?
[7 Feb 2008 23:09]
Jason Brooke
The amount of time that would pass before a crash or hung queries seemed inconsistent - sometimes they'd go for a few days, other times just a few minutes. Meanwhile, the queries that were being run were always quite consistent, and it's a single application using the database. I'm attaching some files with some info about queries etc.
[31 Oct 2008 19:10]
Don Kehn
Can't repeat with at this point with 5.1.29rc, will require a stack trace of the crash in order to determine exactly where the problem occurs. Note: After testing with 6.1, 6.2, 6.3, & 5.1.29rc have not seen this issue.