Bug #9406 | Arabic letter (Feh, U+0641) gets corrupted when entered in a table | ||
---|---|---|---|
Submitted: | 25 Mar 2005 21:06 | Modified: | 25 Apr 2005 18:07 |
Reporter: | Martine Petrod | Email Updates: | |
Status: | Not a Bug | Impact on me: | |
Category: | MySQL Server | Severity: | S2 (Serious) |
Version: | 4.1.7 & 4.1.10a | OS: | Windows (Windows XP) |
Assigned to: | CPU Architecture: | Any |
[25 Mar 2005 21:06]
Martine Petrod
[25 Mar 2005 21:13]
Martine Petrod
Script
Attachment: TestA.sql (text/plain), 1.44 KiB.
[25 Mar 2005 21:13]
Martine Petrod
Output file
Attachment: outfile.txt (application/octet-stream, text), 1.22 KiB.
[25 Mar 2005 21:16]
Martine Petrod
Original source script and output files are attached as utf8 encoded text files in order to preserve the Arabic letters.
[25 Mar 2005 21:18]
Martine Petrod
Category: also MyISAM Table Handler
[26 Mar 2005 2:27]
Heikki Tuuri
Hi! My guess is that you have set the client character set or the mysql client character set wrong in your my.cnf. By default, the mysql interactive SQL client uses the MS-DOS character set. Please test with different client charsets in my.cnf. Regards, Heikki
[26 Mar 2005 10:56]
Martine Petrod
Hi! Thank you very much - You were right and I just had to force utf8 as client character set in my.ini as described at the end of 5.8.1. to make it work. By the way I'm glad to see that bug #7571 (The output of the STATUS (\s) command in mysql had the values for the server and client character sets reversed) is being fixed: that had me confused for a while...
[18 Apr 2005 20:29]
Allen Jensen
Sounds like bug ID 8730. Have seen a couple of these in the database. Still seeing problem in 4.1.10 Unicode character U+30CD converted to UTF8 is (hex) E3 83 8D Store this in a utf8 column (load table infile). Select or any other means of retrevial shows corrupted data. %_set_% all defaulted - show as latin1 except system which shows as utf8. Did a SET NAMES 'utf8'; just before the load.