Bug #101556 | CP1252 is not Latin-1 | ||
---|---|---|---|
Submitted: | 11 Nov 2020 8:53 | Modified: | 11 Nov 2020 13:18 |
Reporter: | Marc Masó | Email Updates: | |
Status: | Verified | Impact on me: | |
Category: | MySQL Server: Charsets | Severity: | S4 (Feature request) |
Version: | OS: | Any | |
Assigned to: | CPU Architecture: | Any | |
Tags: | CP1252, encoding, Latin1, tildes |
[11 Nov 2020 8:53]
Marc Masó
[11 Nov 2020 12:45]
MySQL Verification Team
Hi Mr. maso, Thank you for your bug report. However, we can not change the existing names of the character sets, as this would ruin millions of applications. We also can not change definition of the character sets, for the same reason. So many millions of applications are depending on the charsets / collations remaining as they are. Also, we do not understand what does it mean "Latin-1 is not Latin-1". If you wish to add ISO-8859-1 character set, please provide the justification and the exact definition. In that case, it could be considered as a feature request. We are waiting on your feedback.
[11 Nov 2020 12:57]
Marc Masó
Sorry, a typo. What I meant to say is: "Latin-1 is not equal to CP1252, also know as ISO-8859-1." The big problem is in the range 0x80-0x9F. ISO-8859-1 (latin-1) encodes control codes but CP1252 encodes tildes and other important characters in languages like catalan or french. It would be really good to add this encoding. here is a more detail comparison between ISO-8859-1 and CP-1252: https://www.i18nqa.com/debug/table-iso8859-1-vs-windows-1252.html
[11 Nov 2020 13:18]
MySQL Verification Team
Hi Mr. maso, We have analysed your request and we think that this is a feature that makes sense to be implemented. Hence, we verify that ISO-8859-1 should be implemented as a separate character set.