Bug #11620 | Bug #5497 test case in func_compress.test causes cluster to core | ||
---|---|---|---|
Submitted: | 28 Jun 2005 20:36 | Modified: | 22 Aug 2005 7:36 |
Reporter: | Jonathan Miller | Email Updates: | |
Status: | Closed | Impact on me: | |
Category: | MySQL Cluster: Cluster (NDB) storage engine | Severity: | S2 (Serious) |
Version: | 5.1.0-wl2325-wl1354-new | OS: | Linux (Linux) |
Assigned to: | Pekka Nousiainen | CPU Architecture: | Any |
[28 Jun 2005 20:36]
Jonathan Miller
[28 Jun 2005 20:37]
Jonathan Miller
ndb_2_error
Attachment: ndb_2_error.log (text/x-log), 867 bytes.
[28 Jun 2005 20:38]
Jonathan Miller
ndb 2 trace 1
Attachment: ndb_2_trace.log.1 (application/octet-stream, text), 8.24 KiB.
[28 Jun 2005 20:38]
Jonathan Miller
ndb 2 trace 2
Attachment: ndb_2_trace.log.2 (application/octet-stream, text), 8.24 KiB.
[28 Jun 2005 20:38]
Jonathan Miller
ndb_3_error
Attachment: ndb_3_error.log (text/x-log), 867 bytes.
[28 Jun 2005 20:39]
Jonathan Miller
ndb 3 trace 1
Attachment: ndb_3_trace.log.1 (application/octet-stream, text), 8.24 KiB.
[28 Jun 2005 20:39]
Jonathan Miller
ndb 3 trace 2
Attachment: ndb_3_trace.log.2 (application/octet-stream, text), 8.24 KiB.
[6 Jul 2005 9:17]
Pekka Nousiainen
The query (a test of mysqld max memory?) does not contact NDB. It seems that mysqld used up all resources (memory, cpu). The db nodes worked too slowly. This causes "watchdog" thread to believe ndbd is stuck in a loop and terminates ndbd.
[6 Jul 2005 13:44]
Jonathan Miller
shouldn't the cluster know that we have a resources and react correctly to it. Maybe putting an error in the log that we are out of resources and then shutting down gracefully?
[6 Jul 2005 13:58]
Jonathan Miller
should have stated: shouldn't the cluster know that we have a resources issue and react correctly to it. Maybe putting an error in the log that we are out of resources and then shutting down gracefully?
[6 Jul 2005 14:18]
Pekka Nousiainen
> shouldn't the cluster know that we have a resources issue It just detects that ndbd is "stuck" (does not process new signals, etc). It could be for any reason. The action is to shut down the node so that it can be restarted.
[6 Jul 2005 14:26]
Jonathan Miller
My point is that the cluster should be aware of system resources and know when we are running out and be proactive rather then reactive.