Bug #19328 | Slave timeout with COM_REGISTER_SLAVE error causing stop | ||
---|---|---|---|
Submitted: | 25 Apr 2006 9:00 | Modified: | 22 Jul 2007 18:02 |
Reporter: | Mats Kindahl | Email Updates: | |
Status: | Closed | Impact on me: | |
Category: | MySQL Server: Replication | Severity: | S1 (Critical) |
Version: | 5.1 source, 5.0 | OS: | Any |
Assigned to: | Ramil Kalimullin | CPU Architecture: | Any |
[25 Apr 2006 9:00]
Mats Kindahl
[25 Apr 2006 12:37]
Valeriy Kravchuk
Please, specify the exact tests to run. What version(s) of MySQL server should I check? 5.0.x, 5.1.x?
[25 Apr 2006 13:00]
Mats Kindahl
The fault occurs most frequently when executing cluster replication tests, e.g., rpl_ndb_dd_partitions, since they have a tendency to load the server. I haven't seen it for any other tests, but in principle it can happen for any test, since there is nothing specific with cluster in this aspect.
[25 Apr 2006 21:07]
Elliot Murphy
The cluster team is also hitting this bug, and it shows up sometimes in pushbuild. This needs to be fixed because it is interrupting our ability to fix other bugs.
[4 May 2006 21:21]
Jonathan Miller
http://bugs.mysql.com/?id=19471
[3 Jul 2006 15:16]
Augusto Bott
Just hit this very same problem today. Dropped a database with 50+ tables (all were InnoDB) on a dual-master setup. The relay thread died with this on the error log: [ERROR] Error on COM_REGISTER_SLAVE: 1159 ''
[6 Jun 2007 10:42]
Andrei Elkin
Happened with 5.1-rpl tree after https://intranet.mysql.com/secure/pushbuild/getlog.pl?dir=mysql-5.1-new-rpl&entry=skozlov@... CURRENT_TEST: rpl_server_id2 070606 0:54:55 [Warning] The syntax for replication startup options is deprecated and will be removed in MySQL 5.2. Please use 'CHANGE MASTER' instead. 070606 0:54:55 [Note] Plugin 'InnoDB' disabled by command line option 070606 0:54:55 [Note] Event Scheduler: Loaded 0 events 070606 0:54:55 [Note] /data0/pushbuild/pb/mysql-5.1-new-rpl/277/mysql-5.1-new-rpl-exp/sql/mysqld: ready for connections. Version: '5.1.20-beta-pb277-debug-log' socket: '/dev/shm/pbtmp-ps_row-102/slave.sock' port: 11022 Source distribution 070606 0:54:55 [Note] Slave SQL thread initialized, starting replication in log 'FIRST' at position 0, relay log '/dev/shm/var-ps_row-102/log/slave-relay-bin.000001' position: 4 070606 0:54:55 [Note] next log '/dev/shm/var-ps_row-102/log/slave-relay-bin.000002' is currently active 070606 0:54:55 [Note] Slave I/O thread: connected to master 'root@127.0.0.1:11020',replication started in log 'FIRST' at position 4 070606 0:54:55 [ERROR] Error on COM_REGISTER_SLAVE: 1159 '' 0
[11 Jun 2007 9:31]
Lars Thalmann
See also BUG#22989
[26 Jun 2007 11:37]
Bugs System
A patch for this bug has been committed. After review, it may be pushed to the relevant source trees for release in the next version. You can access the patch from: http://lists.mysql.com/commits/29577 ChangeSet@1.2509, 2007-06-26 13:32:30+05:00, ramil@mysql.com +1 -0 Fix for bug #19328: Slave timeout with COM_REGISTER_SLAVE error causing stop Problem: "Under high load, the slave registering to the master can timeout during the COM_REGISTER_SLAVE execution. This causes an error, which prevents the slave from connecting at all." Fix: Do not abort immediately, but retry registering on master.
[28 Jun 2007 7:39]
Bugs System
A patch for this bug has been committed. After review, it may be pushed to the relevant source trees for release in the next version. You can access the patch from: http://lists.mysql.com/commits/29806 ChangeSet@1.2524, 2007-06-28 12:38:43+05:00, ramil@mysql.com +1 -0 Fix for bug #19328: Slave timeout with COM_REGISTER_SLAVE error causing stop Problem: "Under high load, the slave registering to the master can timeout during the COM_REGISTER_SLAVE execution. This causes an error, which prevents the slave from connecting at all." Fix: Do not abort immediately, but retry registering on master.
[28 Jun 2007 12:18]
Bugs System
A patch for this bug has been committed. After review, it may be pushed to the relevant source trees for release in the next version. You can access the patch from: http://lists.mysql.com/commits/29847 ChangeSet@1.2524, 2007-06-28 17:18:29+05:00, ramil@mysql.com +1 -0 Fix for bug #19328: Slave timeout with COM_REGISTER_SLAVE error causing stop Problem: "Under high load, the slave registering to the master can timeout during the COM_REGISTER_SLAVE execution. This causes an error, which prevents the slave from connecting at all." Fix: Do not abort immediately, but retry registering on master.
[29 Jun 2007 16:48]
Bugs System
A patch for this bug has been committed. After review, it may be pushed to the relevant source trees for release in the next version. You can access the patch from: http://lists.mysql.com/commits/29968 ChangeSet@1.2524, 2007-06-29 21:48:22+05:00, ramil@mysql.com +1 -0 Fix for bug #19328: Slave timeout with COM_REGISTER_SLAVE error causing stop Problem: "Under high load, the slave registering to the master can timeout during the COM_REGISTER_SLAVE execution. This causes an error, which prevents the slave from connecting at all." Fix: Do not abort immediately, but retry registering on master.
[10 Jul 2007 13:26]
Bugs System
Pushed into 5.1.21-beta
[22 Jul 2007 18:02]
Paul DuBois
Noted in 5.1.21 changelog. If a slave timed out while registering with the master to which it was connecting, auto-reconnect failed thereafter.