


MySQL single table data should not exceed 5 million rows: is it an empirical value or a golden rule?
Today, let’s discuss an interesting topic: How much data does a single MySQL table need to consider before it needs to be divided into databases and tables? Some say 20 million rows, others say 5 million rows. So, what do you think this value is appropriate?
There was once a widely circulated saying in China's Internet technology circle: MySQL's performance will drop significantly if the data volume of a single table exceeds 20 million rows. In fact, this rumor is said to have originated from Baidu. The specific situation is probably like this. When the DBA tested the performance of MySQL, he found that when the size of a single table reached 20 million rows, the performance of SQL operations dropped sharply. Therefore, the conclusion comes from this. Then it was said that Baidu engineers moved to other companies in the industry and brought this information with them, so this saying spread in the industry.
Later, Alibaba's "Java Development Manual" proposed that database and table sharding is only recommended when the number of rows in a single table exceeds 5 million or the capacity of a single table exceeds 2GB. This is supported by Alibaba's golden iron rule. Therefore, when many people design big data storage, they will use this as a standard to perform table operations.
So, what do you think is the appropriate value? Why not 3 million rows, or 8 million rows, but 5 million rows? Maybe you would say that this may be Ali's best actual combat value? So, the question comes again, how is this value evaluated? Wait a moment, please think about it for a moment.
In fact, this value has nothing to do with the actual number of records, but is related to the configuration of MySQL and the hardware of the machine. Because, in order to improve performance, MySQL will load the index of the table into memory. When the InnoDB buffer size is sufficient, it can be fully loaded into memory and there will be no problem with querying. However, when a single-table database reaches an upper limit of a certain magnitude, the memory cannot store its index, causing subsequent SQL queries to generate disk IO, resulting in performance degradation. Of course, this is also related to the design of the specific table structure, and the ultimate problem is memory limitation. Here, increasing the hardware configuration may bring immediate performance improvements.
So, my point of view on sub-database and sub-table is that it needs to be combined with actual needs and should not be over-designed. The sub-database and sub-table design should not be used at the beginning of the project. Instead, as the business grows, it will be unavailable. If optimization continues, consider sharding databases and tables to improve system performance. In this regard, Alibaba's "Java Development Manual" adds: If the data volume is not expected to reach this level in three years, please do not divide the database into tables when creating the table. So, back to the original question, what do you think is an appropriate value? My suggestion is to make a comprehensive evaluation based on the situation of your own machine. If you have no standard in mind, then temporarily use 5 million lines as a unified standard, which is relatively a compromise value.
For more MySQL related technical articles, please visit the MySQL Tutorial column to learn!
The above is the detailed content of MySQL single table data should not exceed 5 million rows: is it an empirical value or a golden rule?. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics



MySQL is suitable for beginners because it is simple to install, powerful and easy to manage data. 1. Simple installation and configuration, suitable for a variety of operating systems. 2. Support basic operations such as creating databases and tables, inserting, querying, updating and deleting data. 3. Provide advanced functions such as JOIN operations and subqueries. 4. Performance can be improved through indexing, query optimization and table partitioning. 5. Support backup, recovery and security measures to ensure data security and consistency.

MySQL is an open source relational database management system. 1) Create database and tables: Use the CREATEDATABASE and CREATETABLE commands. 2) Basic operations: INSERT, UPDATE, DELETE and SELECT. 3) Advanced operations: JOIN, subquery and transaction processing. 4) Debugging skills: Check syntax, data type and permissions. 5) Optimization suggestions: Use indexes, avoid SELECT* and use transactions.

You can open phpMyAdmin through the following steps: 1. Log in to the website control panel; 2. Find and click the phpMyAdmin icon; 3. Enter MySQL credentials; 4. Click "Login".

Create a database using Navicat Premium: Connect to the database server and enter the connection parameters. Right-click on the server and select Create Database. Enter the name of the new database and the specified character set and collation. Connect to the new database and create the table in the Object Browser. Right-click on the table and select Insert Data to insert the data.

MySQL and SQL are essential skills for developers. 1.MySQL is an open source relational database management system, and SQL is the standard language used to manage and operate databases. 2.MySQL supports multiple storage engines through efficient data storage and retrieval functions, and SQL completes complex data operations through simple statements. 3. Examples of usage include basic queries and advanced queries, such as filtering and sorting by condition. 4. Common errors include syntax errors and performance issues, which can be optimized by checking SQL statements and using EXPLAIN commands. 5. Performance optimization techniques include using indexes, avoiding full table scanning, optimizing JOIN operations and improving code readability.

You can create a new MySQL connection in Navicat by following the steps: Open the application and select New Connection (Ctrl N). Select "MySQL" as the connection type. Enter the hostname/IP address, port, username, and password. (Optional) Configure advanced options. Save the connection and enter the connection name.

Recovering deleted rows directly from the database is usually impossible unless there is a backup or transaction rollback mechanism. Key point: Transaction rollback: Execute ROLLBACK before the transaction is committed to recover data. Backup: Regular backup of the database can be used to quickly restore data. Database snapshot: You can create a read-only copy of the database and restore the data after the data is deleted accidentally. Use DELETE statement with caution: Check the conditions carefully to avoid accidentally deleting data. Use the WHERE clause: explicitly specify the data to be deleted. Use the test environment: Test before performing a DELETE operation.

Redis uses a single threaded architecture to provide high performance, simplicity, and consistency. It utilizes I/O multiplexing, event loops, non-blocking I/O, and shared memory to improve concurrency, but with limitations of concurrency limitations, single point of failure, and unsuitable for write-intensive workloads.
