Bug #1867 UTF-8
Submitted: 18 Nov 2003 4:50 Modified: 26 Nov 2003 11:59
Reporter: Joel Jordan Email Updates:
Status: Closed Impact on me:
None 
Category:MySQL Server: MyISAM storage engine Severity:S2 (Serious)
Version:4.1.0-alpha OS:Linux (suse 7.2)
Assigned to: Sergei Golubchik CPU Architecture:Any

[18 Nov 2003 4:50] Joel Jordan
Description:
mysql  Ver 13.5 Distrib 4.1.0-alpha, for pc-linux (i686)

There seems to be a problem using UTF-8 (which is supposed to be supported with match/against), in that I have rows containing various words, but only some of them are successfully matched against, eg:

abârîg cannot be matched against
nor can gçtîg nor dçw
however, zarduðt can be, and so can pursîd , mânthr and hç
so... I wonder what's wrong.

Could it be with certain positioning of the long e?  (i.e. e with a straight line over it, and coded as %C4%93 )

How to repeat:
select * from myTextTable where match (textdata) against ('wordAsAbove')
[18 Nov 2003 5:37] Sergei Golubchik
match ... against is supposed to support utf-8 in 4.1.1, not in 4.1.0.