Bug #90730 Outdated charset and collation documentation on manual regarding 8.0 defaults
Submitted: 3 May 2018 8:25 Modified: 19 May 2018 13:31
Reporter: Jaime Crespo (OCA) Email Updates:
Status: Closed Impact on me:
None 
Category:MySQL Server: Documentation Severity:S3 (Non-critical)
Version:8.0 OS:Any
Assigned to: CPU Architecture:Any

[3 May 2018 8:25] Jaime Crespo
Description:
"For applications that store data using the default MySQL character set and collation (latin1, latin1_swedish_ci), no special configuration should be needed."

"utf8: An alias for utfmb3." ('utf8' is currently an alias for the character set UTF8MB3, which will be replaced by UTF8MB4 in a future release.)

https://mysqlserverteam.com/mysql-8-0-when-to-use-utf8mb3-over-utf8mb4/

How to repeat:
https://dev.mysql.com/doc/refman/8.0/en/charset-applications.html

Suggested fix:
A more general face lift should be done on that page, and maybe even the chapter, using by default utf8mb4, saying how to change to latin1 for older compatibility, and in general not assuming latin1_swedish_ci is the default. Also mentioning older defaults, issues with 3-byte utf, change in the meaning of utf8 in the future, etc.

It may apply to other older versions of MySQL, but specially to 8.0.

A search may reveal other references on the 8.0 manual- some of them may be ok, but other state that is the default one: https://dev.mysql.com/doc/search/?d=201&p=1&q=latin1_swedish_ci
[3 May 2018 8:34] MySQL Verification Team
Hello Jaime,

Thank you for the report!

Regards,
Umesh
[19 May 2018 13:31] Paul DuBois
Posted by developer:
 
Section updated.
[12 Dec 2018 15:22] Marcos Albe
Same applies for 5.7; https://dev.mysql.com/doc/refman/5.7/en/charset-applications.html shows

"For applications that store data using the default MySQL character set and collation (latin1, latin1_swedish_ci), no special configuration should be needed. If applications require data storage using a different character set or collation, you can configure character set information several ways: "

And latin1 is not the default anymore...