Bug #30366 | NDB fails to start on OS X, 64 bit | ||
---|---|---|---|
Submitted: | 10 Aug 2007 19:33 | Modified: | 17 Jan 2008 22:34 |
Reporter: | Joerg Bruehe | Email Updates: | |
Status: | Closed | Impact on me: | |
Category: | MySQL Cluster: Cluster (NDB) storage engine | Severity: | S2 (Serious) |
Version: | 5.1 | OS: | MacOS (64bit) |
Assigned to: | Magnus Blåudd | CPU Architecture: | Any |
Tags: | sr5_1 |
[10 Aug 2007 19:33]
Joerg Bruehe
[13 Aug 2007 7:36]
Stewart Smith
Hi Joerg! Could you please: - check output of 'ndb_mgm -e "show"' when it "hangs" (pass -c for connectstring for test or set NDB_CONNECTSTRING env variable) - check (and attach) the cluster log as well as logs for mgm server and data nodes (basically *.log in the ndbcluster directory) This should help in tracking it down. I gather we don't have a host like this in pb running this sort of build regularly.... :(
[13 Aug 2007 8:01]
Joerg Bruehe
I will try to do as requested, but I have to repeat: This happens while automated builds and tests are running, so in general we have little chance for manual intervention and analysis. Currently, the "classic" build is nearly done - if "advanced" gets into this hang, I can try as requested; if not, the saved tree must be used to reproduce the bug.
[13 Sep 2007 23:00]
Bugs System
No feedback was provided for this bug for over a month, so it is being suspended automatically. If you are able to provide the information that was originally requested, please do so and change the status of the bug back to "Open".
[22 Sep 2007 14:13]
Joerg Bruehe
Bug was reproduced in a 5.1.22-rc build, and shown to Cluster support (Stewart).
[29 Oct 2007 16:41]
Magnus Blåudd
The mgm client can't connect properly: osx-tiger-ppc:~/magnus/mysql-5.1.23-beta-pb1577/mysql-test mysqldev$ ../storage/ndb/src/mgmclient/ndb_mgm --ndb-connectstring=host=localhost:10175 -e "show" Connected to Management Server at: localhost:10175
[29 Oct 2007 16:54]
Magnus Blåudd
Repeatable with 64-bit debug compile on osx-tiger-ppc The ndb_mgmd starts and set up the listening socket. It does not seem to respond when you connect to it with ndb_mgm, but telnet works. See below. osx-tiger-ppc:~/magnus/mysql-5.1.23-beta-pb1577/mysql-test mysqldev$ telnet localhost 10175 Connected to localhost. Escape character is '^]'. get version version id: 327959 major: 5 minor: 1 string: Version 5.1.23 (beta)
[29 Oct 2007 16:55]
Magnus Blåudd
But telnet + "get status" hangs half way through. get status node status nodes: 11 node.1.type: NDB << hangs here
[29 Oct 2007 20:34]
Bugs System
A patch for this bug has been committed. After review, it may be pushed to the relevant source trees for release in the next version. You can access the patch from: http://lists.mysql.com/commits/36601 ChangeSet@1.2569, 2007-10-29 21:33:30+01:00, msvensson@pilot.mysql.com +1 -0 Bug#30366 NDB fails to start on OS X, PPC, 64 bit - The errno variable should only be used when the previous socket write failed, it should be regarded as undefined at other times
[29 Oct 2007 20:36]
Magnus Blåudd
The client was now hanging half way through the response. It would probably be better it the server closed the connection when a timeout has occured.
[29 Oct 2007 20:41]
Magnus Blåudd
Something like this, but prefferably for all our users of SocketServer. msvensson@pilot:~/mysql/my51-ndb-bug30366/storage/ndb/src/common$ bk -r diffs -u ===== storage/ndb/src/mgmsrv/Services.cpp 1.95 vs edited ===== --- 1.95/storage/ndb/src/mgmsrv/Services.cpp 2007-07-11 14:36:40 +02:00 +++ edited/storage/ndb/src/mgmsrv/Services.cpp 2007-10-29 21:40:11 +01:00 @@ -349,6 +349,10 @@ MgmApiSession::runSession() m_parser->run(ctx, *this); + if (m_output->timedout() || + m_input->timedout()) + m_stop= true; + if(ctx.m_currentToken == 0) { NdbMutex_Unlock(m_mutex);
[26 Nov 2007 17:54]
Magnus Blåudd
Pushed to mysql-5.1-ndb
[4 Dec 2007 8:08]
Mattias Jonsson
I can verify this on an intel macbook with Mac OS X 10.5.1 (uname -a: Darwin witty 9.1.0 Darwin Kernel Version 9.1.0: Wed Oct 31 17:46:22 PDT 2007; root:xnu-1228.0.2~1/RELEASE_I386 i386). The patch works, now I finally start the full test suite on my new macbook!
[10 Dec 2007 23:24]
Omer Barnir
Root Cause Analysis ------------------- The problem was a result of a change made back in March 22, 2007. The result behavior is different on different platforms so the problem was observed only on OS-X From a testing point of view, once packaged verification is in place, similar problems will be caught
[15 Jan 2008 14:00]
Bugs System
A patch for this bug has been committed. After review, it may be pushed to the relevant source trees for release in the next version. You can access the patch from: http://lists.mysql.com/commits/41012 ChangeSet@1.2652, 2008-01-15 15:01:21+01:00, msvensson@pilot.mysql.com +1 -0 Bug#30366 NDB fails to start on OS X, PPC, 64 bit - The errno variable should only be used when the previous socket write failed, it should be regarded as undefined at other times OutputStream.cpp: Only use "errno" after the attempt to write to the socket has failed
[16 Jan 2008 16:03]
Magnus Blåudd
Pushed to mysql-5.1-release
[17 Jan 2008 22:34]
Jon Stephens
Documented bugfix in 5.1.23 changelog.
[24 Jan 2008 11:02]
Bugs System
A patch for this bug has been committed. After review, it may be pushed to the relevant source trees for release in the next version. You can access the patch from: http://lists.mysql.com/commits/41198 ChangeSet@1.2657, 2008-01-24 12:06:40+01:00, tomas@whalegate.ndb.mysql.com +1 -0 Bug#30366 (recommit) NDB fails to start on OS X, PPC, 64 bit - The errno variable should only be used when the previous socket write failed, it should be regarded as undefined at other times
[7 Feb 2008 9:51]
Magnus Blåudd
Pushed also to mysql-5.1-ndb, mysql-5.1-telco-6.2, mysql-5.1-telco-6.3 and mysql-5.1-telco-6.4
[20 Feb 2008 16:02]
Bugs System
Pushed into 5.1.24-rc
[20 Feb 2008 16:02]
Bugs System
Pushed into 6.0.5-alpha
[25 Feb 2008 15:58]
Bugs System
Pushed into 5.1.24-rc
[25 Feb 2008 16:04]
Bugs System
Pushed into 6.0.5-alpha
[30 Mar 2008 18:57]
Jon Stephens
Fix also documented for 6.0.5.