Bug #113137 | MySQL Innodb cluster hang forever | ||
---|---|---|---|
Submitted: | 20 Nov 2023 4:04 | Modified: | 27 Nov 2023 3:07 |
Reporter: | zetang zeng (OCA) | Email Updates: | |
Status: | Verified | Impact on me: | |
Category: | MySQL Server: Group Replication | Severity: | S3 (Non-critical) |
Version: | 5.7.43 | OS: | Linux |
Assigned to: | CPU Architecture: | Any |
[20 Nov 2023 4:04]
zetang zeng
[21 Nov 2023 9:06]
zetang zeng
Is the `stop group_replication` thread waiting for `xcom_taskmain_startup` thread? It seems because there is inconsistency in xcom task loop (active_tasks is 31, but task link list is empty): ``` (gdb) p active_tasks $2 = 31 (gdb) p &tasks $3 = (linkage *) 0x7f2b81e580a0 <tasks> (gdb) p tasks $4 = {type = 0, suc = 0x7f2b81e580a0 <tasks>, pred = 0x7f2b81e580a0 <tasks>} ```
[23 Nov 2023 21:36]
MySQL Verification Team
Hi, First, I dropped the severity to S3 as this is not a S2 issue. Secondly, while I can reproduce this issue doing exactly what you stated I do not see how is this "regular thing that can happen in real life" (hence it cannot be S2) I am not sure this is a bug as this is not a normal situation. If I properly kill the network (remove eth cable for e.g.) this issue will not reproduce. Anyhow I will verify the report and let GR team decide if they think this is a bug or there is something they can improve upon. Thank you for the report
[27 Nov 2023 3:04]
zetang zeng
Yep, I agree with you that this reproducing case is too rare to happen in real life. But we do met some similar situation in this case(https://sourceware.org/bugzilla/show_bug.cgi?id=30977), which we fail to reproduce. Hope the reason (inconsistency in GCS task?) leads to this problem also the root cause of this one(https://sourceware.org/bugzilla/show_bug.cgi?id=30977)
[27 Nov 2023 3:07]
zetang zeng
Oh sorry, I give the wrong link in last msg https://bugs.mysql.com/bug.php?id=112277&thanks=5¬ify=71