Bug #94709 | Regression behavior for full text index | ||
---|---|---|---|
Submitted: | 19 Mar 2019 19:41 | Modified: | 20 Mar 2019 7:57 |
Reporter: | Carlos Tutte | Email Updates: | |
Status: | Verified | Impact on me: | |
Category: | MySQL Server: FULLTEXT search | Severity: | S3 (Non-critical) |
Version: | 5.7.21,5.7.25, 8.0.15, 5.6.43 | OS: | Any |
Assigned to: | CPU Architecture: | Any | |
Tags: | regression |
[19 Mar 2019 19:41]
Carlos Tutte
[20 Mar 2019 7:57]
MySQL Verification Team
Hello Carlos Tutte, Thank you for the report and test case. Thanks, Umesh
[20 Mar 2019 8:00]
Nikolai Ikhalainen
Hi, 5.7.21+ contains the fix: InnoDB: Full-text search on indexed columns that use a binary collation did not return case-sensitive matches. (Bug #21625016, Bug #78048) SET GLOBAL innodb_ft_aux_table='test/posts_translations_copy2'; mysql> select * from INFORMATION_SCHEMA.INNODB_FT_INDEX_CACHE where word like '%ring'; | WORD | FIRST_DOC_ID | LAST_DOC_ID | DOC_COUNT | DOC_ID | POSITION | +--------+--------------+-------------+-----------+--------+----------+ | Spring | 2 | 2 | 1 | 2 | 8 | In 5.7.20: | WORD | FIRST_DOC_ID | LAST_DOC_ID | DOC_COUNT | DOC_ID | POSITION | +--------+--------------+-------------+-----------+--------+----------+ | spring | 2 | 2 | 1 | 2 | 8 | After alter table posts_translations_copy2 modify column content mediumtext CHARACTER SET utf8mb4 COLLATE utf8mb4_general_ci; | WORD | FIRST_DOC_ID | LAST_DOC_ID | DOC_COUNT | DOC_ID | POSITION | +--------+--------------+-------------+-----------+--------+----------+ | spring | 2 | 2 | 1 | 2 | 8 | Even after set names utf8mb4 COLLATE utf8mb4_bin; It looks like a full text search in boolean mode for double quotes is trying to find lowercase phrase even if we are providing a correct case in AGAINST.