Bug #116826 | default-character-set fails to set the character set in mysqldump correctly | ||
---|---|---|---|
Submitted: | 29 Nov 8:02 | Modified: | 29 Nov 13:40 |
Reporter: | zengyu ma | Email Updates: | |
Status: | Not a Bug | Impact on me: | |
Category: | MySQL Server: mysqldump Command-line Client | Severity: | S3 (Non-critical) |
Version: | mysqldump Ver 8.0.40 for Linux on x86_6 | OS: | CentOS (CentOS Linux release 7.9.2009 (Core)) |
Assigned to: | CPU Architecture: | x86 |
[29 Nov 8:02]
zengyu ma
[29 Nov 10:45]
MySQL Verification Team
Hi Mr. Zengyu, Thank you for your bug report. However, this is not a bug. Character set Latin1 is a single-byte charset and it can not accommodate Chinese ideograms. Chinese ideograms require character sets that can hold up to 3 (three) bytes. That is a part of the Unicode Standard and MySQL follows that standard strictly. Not a bug.
[29 Nov 13:40]
zengyu ma
Thank you for reminding me, but I am confused that the comments in the table structure are not displayed properly, but the data in the table is displayed correctly [mysql@localhost ~]$ mysqldump --login-path=root --quick --extended-insert --default-character-set=latin1 --hex-blob --master-data=2 --single-transaction --set-gtid-purged=OFF -B test --tables la>data.sql WARNING: --master-data is deprecated and will be removed in a future version. Use --source-data instead. [mysql@localhost ~]$ cat data.sql -- MySQL dump 10.13 Distrib 8.0.40, for Linux (x86_64) -- -- Host: localhost Database: test -- ------------------------------------------------------ -- Server version 8.0.40 /*!40101 SET @OLD_CHARACTER_SET_CLIENT=@@CHARACTER_SET_CLIENT */; /*!40101 SET @OLD_CHARACTER_SET_RESULTS=@@CHARACTER_SET_RESULTS */; /*!40101 SET @OLD_COLLATION_CONNECTION=@@COLLATION_CONNECTION */; /*!50503 SET NAMES latin1 */; /*!40103 SET @OLD_TIME_ZONE=@@TIME_ZONE */; /*!40103 SET TIME_ZONE='+00:00' */; /*!40014 SET @OLD_UNIQUE_CHECKS=@@UNIQUE_CHECKS, UNIQUE_CHECKS=0 */; /*!40014 SET @OLD_FOREIGN_KEY_CHECKS=@@FOREIGN_KEY_CHECKS, FOREIGN_KEY_CHECKS=0 */; /*!40101 SET @OLD_SQL_MODE=@@SQL_MODE, SQL_MODE='NO_AUTO_VALUE_ON_ZERO' */; /*!40111 SET @OLD_SQL_NOTES=@@SQL_NOTES, SQL_NOTES=0 */; -- -- Position to start replication or point-in-time recovery from -- -- CHANGE MASTER TO MASTER_LOG_FILE='mysql-bin.000003', MASTER_LOG_POS=1341; -- -- Table structure for table `la` -- DROP TABLE IF EXISTS `la`; /*!40101 SET @saved_cs_client = @@character_set_client */; /*!50503 SET character_set_client = utf8mb4 */; CREATE TABLE `la` ( `id` int NOT NULL AUTO_INCREMENT COMMENT '主键', `name` varchar(5) DEFAULT NULL, PRIMARY KEY (`id`) ) ENGINE=InnoDB AUTO_INCREMENT=3 DEFAULT CHARSET=latin1; /*!40101 SET character_set_client = @saved_cs_client */; -- -- Dumping data for table `la` -- LOCK TABLES `la` WRITE; /*!40000 ALTER TABLE `la` DISABLE KEYS */; INSERT INTO `la` VALUES (1,'a'),(2,'中'); /*!40000 ALTER TABLE `la` ENABLE KEYS */; UNLOCK TABLES; /*!40103 SET TIME_ZONE=@OLD_TIME_ZONE */; /*!40101 SET SQL_MODE=@OLD_SQL_MODE */; /*!40014 SET FOREIGN_KEY_CHECKS=@OLD_FOREIGN_KEY_CHECKS */; /*!40014 SET UNIQUE_CHECKS=@OLD_UNIQUE_CHECKS */; /*!40101 SET CHARACTER_SET_CLIENT=@OLD_CHARACTER_SET_CLIENT */; /*!40101 SET CHARACTER_SET_RESULTS=@OLD_CHARACTER_SET_RESULTS */; /*!40101 SET COLLATION_CONNECTION=@OLD_COLLATION_CONNECTION */; /*!40111 SET SQL_NOTES=@OLD_SQL_NOTES */; -- Dump completed on 2024-11-29 21:22:27
[29 Nov 13:47]
MySQL Verification Team
Hi Mr. Zengyu, Thank you for your bug report. However, this is not a bug. Character set Latin1 is a single-byte charset and it can not accommodate Chinese ideograms. Chinese ideograms require character sets that can hold up to 3 (three) bytes. That is a part of the Unicode Standard and MySQL follows that standard strictly. Not a bug.