MySQL Bugs: #20646: Node corruption after REDO logs overloaded eror

Bug #20646	Node corruption after REDO logs overloaded eror
Submitted:	23 Jun 2006 3:04	Modified:	23 Jul 2006 6:28
Reporter:	Lachlan Mulcahy	Email Updates:
Status:	No Feedback	Impact on me:	None
Category:	MySQL Cluster: Cluster (NDB) storage engine	Severity:	S2 (Serious)
Version:	5.0.21 cluster	OS:	Linux (Linux)
Assigned to:		CPU Architecture:	Any

Description:
After receiving a "Got temporary error 410 'REDO log files overloaded, consult online manual" error, nodes were corrupted:

Node 3: Forced node shutdown completed. Occured during startphase 4. Initiated by signal 0. Caused by error 2815: 'File not found(Ndbd file system inconsistency error, please report a bug). Ndbd file system error, restart node initial'.
Node 2: Forced node shutdown completed. Occured during startphase 4. Initiated by signal 0. Caused by error 2815: 'File not found(Ndbd file system inconsistency error, please report a bug). Ndbd file system error, restart node initial'.

How to repeat:
Possibly increase TimeBetweenLocalCheckpoints and/or decrease NoOfFragmentLogFiles until your transaction load overloads the REDO logs.

Suggested fix:
n/a

would be good to get the logs from the cluster.

this can be fetched with (i think it's included in that release) ndb_error_reporter or by manually copying ndb_*log and ndb*trace*

Did they change the value on "NoOfFragmentLogfiles". if they did, then this is not a bug.

After changing that value, you need to perform a rolling restart --initial
A system restart will not work.

But they should be able to change it back, perfrom a system restart.
And the change it using rolling restart --initial

No feedback was provided for this bug for over a month, so it is
being suspended automatically. If you are able to provide the
information that was originally requested, please do so and change
the status of the bug back to "Open".