Description:
2-node cluster is busy handling a node failure
node is restarted with low partition timeout
and desides to start by itself
cluster comes up partitioned
2006-01-10 14:20:35 [MgmSrvr] ALERT -- Node 3: Node 4 Disconnected
2006-01-10 14:20:35 [MgmSrvr] INFO -- Node 3: Communication to Node 4 closed
2006-01-10 14:20:35 [MgmSrvr] ALERT -- Node 3: Network partitioning - arbitration required
2006-01-10 14:20:35 [MgmSrvr] INFO -- Node 3: President restarts arbitration thread [state=7]
2006-01-10 14:20:35 [MgmSrvr] INFO -- Node 1: Node 4 Connected
2006-01-10 14:20:35 [MgmSrvr] ALERT -- Node 3: Arbitration won - positive reply from node 1
2006-01-10 14:20:35 [MgmSrvr] INFO -- Node 3: GCP Take over started
2006-01-10 14:20:35 [MgmSrvr] INFO -- Node 3: GCP Take over completed
2006-01-10 14:20:35 [MgmSrvr] INFO -- Node 4: Node shutdown completed, restarting.
2006-01-10 14:20:35 [MgmSrvr] WARNING -- Allocate nodeid (4) failed. Connection from ip 172.30.16.130. Returned error string "Id 4 already allocated by an
other node."
2006-01-10 14:20:35 [MgmSrvr] INFO -- Mgmt server state: node id's 2 3 4 7 8 connected but not reserved
2006-01-10 14:20:35 [MgmSrvr] INFO -- Mgmt server state: node id's 1 not connected but reserved
2006-01-10 14:20:35 [MgmSrvr] INFO -- Node 3: Started arbitrator node 1 [ticket=03920004b4799a0b]
2006-01-10 14:20:38 [MgmSrvr] INFO -- Mgmt server state: nodeid 4 reserved for ip 172.30.16.130, m_reserved_nodes 0000000000000672.
2006-01-10 14:20:38 [MgmSrvr] INFO -- Node 1: Node 4 Connected
2006-01-10 14:20:39 [MgmSrvr] INFO -- Mgmt server state: nodeid 4 freed, m_reserved_nodes 0000000000000662.
2006-01-10 14:20:50 [MgmSrvr] INFO -- Node 4: Start phase 1 completed
2006-01-10 14:20:56 [MgmSrvr] INFO -- Node 3: Backup 10 started from node 1
2006-01-10 14:21:10 [MgmSrvr] INFO -- Node 4: Start phase 2 completed (system restart)
2006-01-10 14:21:10 [MgmSrvr] INFO -- Node 4: Start phase 3 completed (system restart)
2006-01-10 14:21:33 [MgmSrvr] WARNING -- Node 3: Failure handling of node 4 has not completed in 1 min. - state = 3
....
2006-01-10 14:21:41 [MgmSrvr] INFO -- Node 3: Communication to Node 4 opened
2006-01-10 14:21:41 [MgmSrvr] INFO -- Node 3: Node 4 Connected
2006-01-10 14:21:41 [MgmSrvr] INFO -- Node 4: Node 3 Connected
2006-01-10 14:21:53 [MgmSrvr] INFO -- Node 4: Start phase 4 completed (system restart)
2006-01-10 14:21:54 [MgmSrvr] INFO -- Node 4: Local checkpoint 120 started. Keep GCI = 172315 oldest restorable GCI = 172391
2006-01-10 14:24:39 [MgmSrvr] INFO -- Mgmt server state: nodeid 11 reserved for ip 172.30.16.1, m_reserved_nodes 0000000000000e62.
2006-01-10 14:24:39 [MgmSrvr] INFO -- Node 3: Node 11 Connected
2006-01-10 14:24:39 [MgmSrvr] INFO -- Node 3: Node 11: API version 5.1.2
2006-01-10 14:24:39 [MgmSrvr] ALERT -- Node 3: Node 11 Disconnected
2006-01-10 14:24:39 [MgmSrvr] INFO -- Node 3: Communication to Node 11 closed
2006-01-10 14:24:41 [MgmSrvr] INFO -- Mgmt server state: nodeid 11 freed, m_reserved_nodes 0000000000000662.
2006-01-10 14:24:42 [MgmSrvr] INFO -- Mgmt server state: nodeid 11 reserved for ip 172.30.16.1, m_reserved_nodes 0000000000000e62.
2006-01-10 14:24:42 [MgmSrvr] INFO -- Node 3: Communication to Node 11 opened
2006-01-10 14:24:42 [MgmSrvr] INFO -- Node 3: Node 11 Connected
2006-01-10 14:24:42 [MgmSrvr] INFO -- Node 3: Node 11: API version 5.1.2
2006-01-10 14:24:43 [MgmSrvr] ALERT -- Node 3: Node 11 Disconnected
2006-01-10 14:24:43 [MgmSrvr] INFO -- Node 3: Communication to Node 11 closed
2006-01-10 14:24:44 [MgmSrvr] INFO -- Mgmt server state: nodeid 11 freed, m_reserved_nodes 0000000000000662.
2006-01-10 14:24:45 [MgmSrvr] INFO -- Mgmt server state: nodeid 11 reserved for ip 172.30.16.1, m_reserved_nodes 0000000000000e62.
2006-01-10 14:24:46 [MgmSrvr] INFO -- Node 3: Communication to Node 11 opened
2006-01-10 14:24:47 [MgmSrvr] INFO -- Node 3: Node 11 Connected
2006-01-10 14:24:47 [MgmSrvr] INFO -- Node 3: Node 11: API version 5.1.2
2006-01-10 14:24:47 [MgmSrvr] ALERT -- Node 3: Node 11 Disconnected
2006-01-10 14:24:47 [MgmSrvr] INFO -- Node 3: Communication to Node 11 closed
2006-01-10 14:24:48 [MgmSrvr] INFO -- Mgmt server state: nodeid 11 freed, m_reserved_nodes 0000000000000662.
2006-01-10 14:24:50 [MgmSrvr] INFO -- Node 3: Communication to Node 11 opened
2006-01-10 14:25:45 [MgmSrvr] INFO -- Node 3: Backup 10 started from node 1 completed
StartGCP: 172493 StopGCP: 172626
#Records: 5342128 #LogRecords: 1428
Data: 768994140 bytes Log: 57416 bytes
2006-01-10 14:25:46 [MgmSrvr] INFO -- Node 4: Start phase 5 completed (system restart)
2006-01-10 14:25:46 [MgmSrvr] INFO -- Node 4: Start phase 6 completed (system restart)
2006-01-10 14:25:46 [MgmSrvr] INFO -- Node 4: President restarts arbitration thread [state=1]
2006-01-10 14:25:46 [MgmSrvr] INFO -- Node 4: Start phase 7 completed (system restart)
How to repeat:
see above