Description:
I got some problems when I try to restart the ndb nodes.
In My Cluster configuration, there are 12 ndb nodes in total. I use command "ndbd -n" to start all the ndb nodes respectively, after that I use "all start" in management client to actually start all ndb nodes.
After a while, in node start up phase 4, I got the following error in the management client:
######################################
Node 25: Forced node shutdown completed. Occured during startphase 4. Caused by error 32782: 'No message slogan found (please report a bug if you get this error code)(Unknown). Unknown'.
Node 15: Forced node shutdown completed. Occured during startphase 4. Caused by error 32782: 'No message slogan found (please report a bug if you get this error code)(Unknown). Unknown'.
Node 22: Forced node shutdown completed. Occured during startphase 4. Caused by error 2308: 'Another node failed during system restart, please investigate error(s) on other node(s)(Restart error). Temporary error, restart node'.
Node 17: Forced node shutdown completed. Occured during startphase 4. Caused by error 2308: 'Another node failed during system restart, please investigate error(s) on other node(s)(Restart error). Temporary error, restart node'.
Node 27: Forced node shutdown completed. Occured during startphase 4. Caused by error 2308: 'Another node failed during system restart, please investigate error(s) on other node(s)(Restart error). Temporary error, restart node'.
Node 14: Forced node shutdown completed. Occured during startphase 4. Caused by error 2308: 'Another node failed during system restart, please investigate error(s) on other node(s)(Restart error). Temporary error, restart node'.
Node 24: Forced node shutdown completed. Occured during startphase 4. Caused by error 2308: 'Another node failed during system restart, please investigate error(s) on other node(s)(Restart error). Temporary error, restart node'.
...
###################
I realized it could be some problems with node 15 and node 25, then I checked the error log in node 15:
###################################
The following is the content of file "ndb_15_error.log"
Time: Monday 17 August 2009 - 22:20:07
Status: Unknown
Message: No message slogan found (please report a bug if you get this error code) (Unknown)
Error: 32782
Error data: DBLQH: File system open failed. OS errno: 4294967295
Error object: DBLQH (Line: 3073) 0x0000000a
Program: /usr/local/mysql/mysql-cluster/bin/ndbd
Pid: 15036
Trace: /var/lib/ndb/ndb-15/ndb_15_trace.log.6
Version: mysql-5.1.34 ndb-7.0.6
######################################
This is part of the content from ndb_15_trace.log.6
###########################
--------------- Signal ----------------
r.bn: 246 "DBDIH", r.proc: 15, r.sigId: 1988714 gsn: 238 "DISEIZEREQ" prio: 1
s.bn: 245 "DBTC", s.proc: 15, s.sigId: 1988713 length: 2 trace: 1 #sec: 0 fragInf: 0
H'0009e47c H'00f5000f
--------------- Signal ----------------
r.bn: 245 "DBTC", r.proc: 15, r.sigId: 1988713 gsn: 236 "DISEIZECONF" prio: 1
s.bn: 246 "DBDIH", s.proc: 15, s.sigId: 1988712 length: 2 trace: 1 #sec: 0 fragInf: 0
H'0009e47b H'0004eeeb
--------------- Signal ----------------
r.bn: 246 "DBDIH", r.proc: 15, r.sigId: 1988712 gsn: 238 "DISEIZEREQ" prio: 1
s.bn: 245 "DBTC", s.proc: 15, s.sigId: 1988711 length: 2 trace: 1 #sec: 0 fragInf: 0
H'0009e47b H'00f5000f
--------------- Signal ----------------
r.bn: 245 "DBTC", r.proc: 15, r.sigId: 1988711 gsn: 236 "DISEIZECONF" prio: 1
s.bn: 246 "DBDIH", s.proc: 15, s.sigId: 1988710 length: 2 trace: 1 #sec: 0 fragInf: 0
H'0009e47a H'0004eeea
##################################
It doesn't make sense to me anyway.
So any ideas about why I got this problem? I tried few times and got the same errors.
How to repeat:
In My Cluster configuration, there are 12 ndb nodes in total.
I use command "ndbd -n" to start all the ndb nodes respectively, after that I use "all start" in management client to actually start all ndb nodes.
During the ndb node starting up, I got some errors, see description section for details.