Bug #47805 inproper UTF8 handling TEXT, LONGTEXT ... other TEXT types
Submitted: 3 Oct 2009 14:07 Modified: 3 Nov 2009 15:57
Reporter: Peter Solti Email Updates:
Status: No Feedback Impact on me:
None 
Category:MySQL Server: Charsets Severity:S2 (Serious)
Version:5.0.27-community-nt OS:Windows
Assigned to: CPU Architecture:Any
Tags: TEXT type, utf8

[3 Oct 2009 14:07] Peter Solti
Description:
I created a webpage, which accesses a MySQL database. Server IIS, language ASP.
Created a test Table with 4 fields:
id
text_1  text
text_2  varchar(100)

Conditions:
- default charset and collation: UTF8, UTF8_general_ci, TEXT type has own UTF8 and UTF8_general_ci collation settings.
- the html form control for each field is TEXTAREA.

Inserted special Hungarian characters into both field types and the result:

In case of TEXT fields, the storing and retrieving shows the followings:
- é -> OK.
- á -> OK.
- ő -> UTF C5 91 incorrectly stored: C3 B5
- ű -> UTF C5 B1 incorrectly stored: C3 BB

In case of VARCHAR fields, the storing and retrieving is done OK, without miscoding.

How to repeat:
Repeat as described above.
[3 Oct 2009 14:26] Peter Solti
The storing is done through ODBC 5.1
[3 Oct 2009 15:57] Valeriy Kravchuk
Thank you for the problem report. Please, try to repeat with a newer version, 5.0.86, and inform about the results.
[4 Nov 2009 0:01] Bugs System
No feedback was provided for this bug for over a month, so it is
being suspended automatically. If you are able to provide the
information that was originally requested, please do so and change
the status of the bug back to "Open".