


Comprehensive understanding of the difference between utf8 and utf8mb4 in mysql_MySQL
1. Introduction
MySQL added the utf8mb4 encoding after 5.5.3. mb4 means most bytes 4, which is specially designed to be compatible with four-byte unicode. Fortunately, utf8mb4 is a superset of utf8, and no other conversion is required except changing the encoding to utf8mb4. Of course, in order to save space, it is usually enough to use utf8.
2. Content description
As mentioned above, since utf8 can store most Chinese characters, why should we use utf8mb4? It turns out that the maximum character length of the utf8 encoding supported by mysql is 3 bytes. If a 4-byte wide character is encountered, an exception will be inserted. . The maximum Unicode character that can be encoded by three-byte UTF-8 is 0xffff, which is the Basic Multilingual Plane (BMP) in Unicode. In other words, any Unicode characters that are not in the basic multi-text plane cannot be stored using Mysql's utf8 character set. Including Emoji expressions (Emoji is a special Unicode encoding, commonly found on ios and android phones), many uncommon Chinese characters, as well as any new Unicode characters, etc.
3. Root of the problem
The original UTF-8 format uses one to six bytes and can encode up to 31 characters. The latest UTF-8 specification uses only one to four bytes and can encode up to 21 bits, which is just enough to represent all 17 Unicode planes.
utf8 is a character set in Mysql that only supports UTF-8 characters up to three bytes, which is the basic multi-text plane in Unicode.
Why does utf8 in Mysql only support UTF-8 characters with a maximum length of three bytes? I thought about it for a while, maybe because when Mysql first started to be developed, Unicode did not have an auxiliary plane. At that time, the Unicode Committee was still dreaming that "65535 characters are enough for the whole world." The string length in Mysql is calculated as the number of characters rather than the number of bytes. For the CHAR data type, sufficient length needs to be reserved for the string. When using the utf8 character set, the length that needs to be reserved is the longest character length of utf8 multiplied by the string length, so of course the maximum length of utf8 is limited to 3. For example, CHAR(100) Mysql will reserve 300 bytes. As for why subsequent versions do not provide support for 4-byte length UTF-8 characters, I think one is for backward compatibility considerations, and the other is that characters outside the basic multilingual plane are rarely used.
To save 4-byte length UTF-8 characters in Mysql, you need to use the utf8mb4 character set, but it is only supported after version 5.5.3 (check the version: select version();). I think that in order to obtain better compatibility, you should always use utf8mb4 instead of utf8. For CHAR type data, utf8mb4 will consume more space. According to Mysql official recommendations, use VARCHAR instead of CHAR.
The above article provides a comprehensive understanding of the difference between utf8 and utf8mb4 in mysql. This is all the content shared by the editor. I hope it can give you a reference and I hope you will support me a lot.

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics



You can open phpMyAdmin through the following steps: 1. Log in to the website control panel; 2. Find and click the phpMyAdmin icon; 3. Enter MySQL credentials; 4. Click "Login".

MySQL is an open source relational database management system, mainly used to store and retrieve data quickly and reliably. Its working principle includes client requests, query resolution, execution of queries and return results. Examples of usage include creating tables, inserting and querying data, and advanced features such as JOIN operations. Common errors involve SQL syntax, data types, and permissions, and optimization suggestions include the use of indexes, optimized queries, and partitioning of tables.

Redis uses a single threaded architecture to provide high performance, simplicity, and consistency. It utilizes I/O multiplexing, event loops, non-blocking I/O, and shared memory to improve concurrency, but with limitations of concurrency limitations, single point of failure, and unsuitable for write-intensive workloads.

MySQL is chosen for its performance, reliability, ease of use, and community support. 1.MySQL provides efficient data storage and retrieval functions, supporting multiple data types and advanced query operations. 2. Adopt client-server architecture and multiple storage engines to support transaction and query optimization. 3. Easy to use, supports a variety of operating systems and programming languages. 4. Have strong community support and provide rich resources and solutions.

MySQL's position in databases and programming is very important. It is an open source relational database management system that is widely used in various application scenarios. 1) MySQL provides efficient data storage, organization and retrieval functions, supporting Web, mobile and enterprise-level systems. 2) It uses a client-server architecture, supports multiple storage engines and index optimization. 3) Basic usages include creating tables and inserting data, and advanced usages involve multi-table JOINs and complex queries. 4) Frequently asked questions such as SQL syntax errors and performance issues can be debugged through the EXPLAIN command and slow query log. 5) Performance optimization methods include rational use of indexes, optimized query and use of caches. Best practices include using transactions and PreparedStatemen

Effective monitoring of Redis databases is critical to maintaining optimal performance, identifying potential bottlenecks, and ensuring overall system reliability. Redis Exporter Service is a powerful utility designed to monitor Redis databases using Prometheus. This tutorial will guide you through the complete setup and configuration of Redis Exporter Service, ensuring you seamlessly build monitoring solutions. By studying this tutorial, you will achieve fully operational monitoring settings

The methods for viewing SQL database errors are: 1. View error messages directly; 2. Use SHOW ERRORS and SHOW WARNINGS commands; 3. Access the error log; 4. Use error codes to find the cause of the error; 5. Check the database connection and query syntax; 6. Use debugging tools.

Apache connects to a database requires the following steps: Install the database driver. Configure the web.xml file to create a connection pool. Create a JDBC data source and specify the connection settings. Use the JDBC API to access the database from Java code, including getting connections, creating statements, binding parameters, executing queries or updates, and processing results.
