Bug #68261 | ndbmtd, mysqld process fails | ||
---|---|---|---|
Submitted: | 4 Feb 2013 10:44 | Modified: | 18 May 2013 14:50 |
Reporter: | Mateusz Kamola | Email Updates: | |
Status: | No Feedback | Impact on me: | |
Category: | MySQL Cluster: Cluster (NDB) storage engine | Severity: | S1 (Critical) |
Version: | mysql-5.5.29 ndb-7.2.10 | OS: | Linux (Debian GNU/Linux wheezy) |
Assigned to: | CPU Architecture: | Any |
[4 Feb 2013 10:44]
Mateusz Kamola
[4 Feb 2013 10:56]
Mateusz Kamola
I have uploaded data generated by ndb_error_reporter - it's 9mb so it's uploaded to ftp.
[11 Feb 2013 15:28]
chris kyte
Experiencing exactly the same issue. Nightmare!
[11 Feb 2013 15:36]
Mateusz Kamola
@chris kyte : we have managed to significantly decrease probability of crash (it's working for about 5 days now since last crash, earlier it happened every night) by loading data in much smaller chunks. Now we are inserting only about 30k rows in one LOAD DATA INFILE and it's much more stable.
[11 Apr 2013 14:07]
Shahryar Ghazi
Hi Mateusz, - Were you able to reproduce the issue? - Are you still seeing SQL nodes crash with high load? Can you upload the NDB error report, cluster logs and configuration file, and mysql error log and configuration files again, please? - Do SQL nodes crash only when there is high load else they run fine (100% of the time)? Please let me know. Thanks.
[11 Apr 2013 14:40]
Mateusz Kamola
Hi Syed, We have decided to switch to Diskless mode (we were concerned about our SSDs heavy written by cluster) after which everything was working much better and more stable (but not perfect). We still experienced crashes but we managed to isolate one of the causes. Some of the data from ndb tables was copied to innodb tables where we were running some analitic stuff. One of the queries used GROUP BY on few very large tables - it created big internal temporary table which didn't fit into RAM and was moved to disk. Then results were sorted which (because data was on disk) caused 100% disk usage for couple minutes - few minutes of 100% disk usage and cluster was down. Every single time. Even though we were using Diskless mode. This surely was one of the reasons we experienced crashes earlier - however this caused all NDB nodes to crash, not SQL nodes. After we changed queries and didn't allow mysql to sort huge tables on disk and we are running in Diskless mode, mysql cluster stopped crashing. Still, another problem came up which I described here http://forums.mysql.com/read.php?25,583570,583570 ... if you could point me to some info on why it's happening I'd be grateful. I'm not able to upload NDB error report, since Cluster is not crashing anymore, and I don't have the one I uploaded when sending this bug report. Crashes happened only under high load, else everything run fine (unless we tried to run "alter table" on Temporary Innodb table, which is a different bug I believe, causing whole cluster to crash). If you have any more questions, please let me know Thanks.
[11 Apr 2013 16:58]
Shahryar Ghazi
Hi Mateusz, Can you upload cluster log and configuration files, and mysql error log and configuration files, please? Please make sure you upload log files containing errors/warnings. Thanks.
[19 May 2013 1:00]
Bugs System
No feedback was provided for this bug for over a month, so it is being suspended automatically. If you are able to provide the information that was originally requested, please do so and change the status of the bug back to "Open".
[24 May 2013 21:05]
Shahryar Ghazi
are you using NIC bonding? If yes, what bonding mode are you using?