Bug #97569 | Connector is inserting Japanese as gibberish when using utf8 charset | ||
---|---|---|---|
Submitted: | 10 Nov 2019 7:00 | Modified: | 11 Nov 2019 5:49 |
Reporter: | Chris M | Email Updates: | |
Status: | Not a Bug | Impact on me: | |
Category: | Connector / C | Severity: | S3 (Non-critical) |
Version: | 8.0.18 | OS: | Ubuntu (16.04) |
Assigned to: | MySQL Verification Team | CPU Architecture: | x86 |
Tags: | jibberish, truncate, utf8, utf8mb4 |
[10 Nov 2019 7:00]
Chris M
[10 Nov 2019 14:17]
MySQL Verification Team
Thank you for the bug report. Please provide the complete test case (the C client file attaching it with Files tab). Thanks.
[10 Nov 2019 21:05]
Chris M
Example program - please read the comments at the top
Attachment: bugtest.c (application/octet-stream, text), 2.82 KiB.
[10 Nov 2019 21:05]
Chris M
textfile that the test program uses, contains english and japanese text
Attachment: textfile (application/octet-stream, text), 35 bytes.
[11 Nov 2019 5:49]
Ryusuke Kajiyama
Attached textfile is encoded in Shift JIS, not UTF-8. Using MySQL charset cp932 is suggested. Or, convert text data into UTF-8 and continue to use utf8mb4 charset in MySQL.