Bug #561 utf-8 charset ,ossomg
Submitted: 31 May 2003 23:53 Modified: 1 Jun 2003 6:42
Reporter: Robert Mark Bram Email Updates:
Status: Duplicate Impact on me:
None 
Category:MySQL Server: Packaging Severity:S2 (Serious)
Version:Ver 13.5 Distrib 4.1.0-alpha OS:Windows (Windows XP)
Assigned to: CPU Architecture:Any

[31 May 2003 23:53] Robert Mark Bram
Description:
The documentation for this version says that two forms of 
Unicode are supported - ucs2 and utf8. However, the xml 
charset files for these character sets were not 
included in the packaging. 

In my distribution the charset directory is:
C:\mysql\share\charsets

There are 25 files in this directory such as latin1.xml, hebrew.xml 
etc but none for Unicode/ucs2/utf8. 

This means that when I try to execute a CREATE TABLE statement that 
includes a unicode charset I get an error message.

How to repeat:
mysql> CREATE TABLE `language_a` (
    ->      `id`         int (11) NOT NULL auto_increment,
    ->      `tagType`    varchar (50) default 'text',
    ->      `text`       LONGTEXT CHARACTER SET 'utf-8' NOT NULL,
    ->      PRIMARY KEY  (`id`),
    ->      UNIQUE KEY `id` (`id`)
    ->    ) TYPE=InnoDB
    -> ;
ERROR 1115: Unknown character set: 'utf-8'
mysql> exit
Bye

Suggested fix:
Include the unicode charsets in the mysql package!
[1 Jun 2003 4:26] MySQL Verification Team
Thanks you for the bug report but this issue was already reported by
the bug report #312.
The fix will be added in the next release 4.1.1.
[1 Jun 2003 5:36] Robert Mark Bram
Thanks for letting me know - I look forward to the next version. :)
In the meantime, can we download the utf8 charset anywhere?

Thanks!
Rob
:)
[1 Jun 2003 6:42] Mark Matthews
'UTF-8' is a 'compiled' character set. It does not use the config file, it can only be enabled when compiling the binary.
[23 May 2005 22:27] James Tomek
Hi, I am not sure if I understand it right. I just installed version 4.1.12-nt. When I look at the \share\charsets directory, utf8 is not there. Is it missing again or when you said "'UTF-8' is a 'compiled' character set. It does not use the config file, it can only be enabled when compiling the binary." it means that the char set is somehow inside the program and therefore does not appear in this directory at all and never will? Thanks.