MySQL big data query performance optimization tutorial (picture)
MySQL performance optimization includes table optimization and column type selection. What can table optimization be broken down into? 1. Separate fixed-length and variable-length fields; 2. Separate commonly used fields from uncommon fields; 3. Add redundant fields to 1-to-many fields that require correlation statistics.
1. Table optimization and column type selection
Table optimization:
1. Fixed length Separate from the change of length
##al, such as ID int, account for 4 bytes, Char (4) accounts for 4 characters length, and it is also fixed. Time is the byte of each unit value. . Core and commonly used fields should be built to a fixed length and placed in one table. and Varchar, Text, Blob, long fields are suitable for placing a single table and associated with the main key with the core table.2. Commonly used fields and less commonly used fields should be separated
need to be analyzed in conjunction with the specific business of the website, and the query scenarios of the fields should be separated. Take it apart.3. Add redundant fields to the 1-to-many fields that require related statistics.
See the following effect:2. Column type selection
1. Field type priority
Integer type>date time>enum
char>varchar>blob,text Integer type: fixed length, no country/region distinction, no character set difference. For example:
tinyint 1,2,3,4,5 char(1) a,b,c,d,e
In terms of space, they all occupy 1 Bytes, but order by sorting, the former is faster. The reason may be that the character set and collation set (that is, the sorting rules) need to be considered;
The time is fixed length, the operation is fast, and the space is saved. Considering the time zone, it is inconvenient to write sql where > `2018-08-08`;
enum, which can serve the purpose of constraint, is stored internally using integers, but when jointly querying with cahr, the internal Go through the conversion of strings and values;
char fixed length, consider the character set and (sorting) proofreading set;
varchar variable length, need to consider the character set conversion and proofreading set when sorting, Slow speed;
text/blob cannot use memory temporary table (sorting and other operations can only be performed on disk)
Attachment: Regarding the selection of date/time, the master’s clear opinion, choose directly int unsgined not null, stores timestamp.
For example:
Gender: Take utf8 as an example
char(1), 3 bytes long
enum('Male',' Female'); Internally converted into numbers for storage, one more conversion process
tinyint(), fixed length 1 byte
2. Just use enough, don’t be generous (such as smallint varchar(N))
Reason: Large bytes waste memory and affect speed.
Taking age as an example tinyint unsigned not null can store 255 years old, which is enough. Using int wastes 3 bytes;
The content stored in varchar(10) and varchar(300) is the same, but varchar(300) takes more memory during table join query.
3. Try to avoid using NULL()Reason: NULL is not conducive to indexing and must be marked with special characters.
The space occupied on the disk is actually larger (MySQL5.5 has improved null, but the query is still inconvenient)
3. Index optimization strategy
1. Index type1.1 B-tree index
It’s called btree index. From a broad perspective, they all use balanced trees, but in terms of specific implementation, each engine is slightly different. For example, strictly speaking, the NDB engine uses T-tree.
But abstracting the B-tree system, it can be understood as a "sorted fast query structure".
1.2 Hash index
The default is hash index in the memory table, and the theoretical query time complexity of hash is O(1).
Question: Since hash search is so efficient, why not use hash index?
Answer:
1. The result calculated by the hash function is random. If the data is placed on the disk, taking the primary key as id as an example, then as the id grows, the id The corresponding rows are randomly placed on the disk.
2. Range query cannot be optimized.
3. The prefix index cannot be used. For example, in btree, the value of the field column is "helloworld", and the index query x=helloworld can naturally use the index, and x=hello can also use the index (left prefix index) .
4. Sorting cannot be optimized.
5. The row must be returned, which means that to get the data location through the index, the data must be returned to the table.
2. Common misunderstandings of btree indexes
2.1 Add indexes on columns commonly used in where conditions, for example:
where cat_id = 3 and price> ;100; Check the third column for products over 100 yuan.
Misunderstanding: Add indexes to both cat_id and price.
Error: Only cat_id or price index can be used, because they are independent indexes, and only one can be used at the same time.
2.2 After creating an index on multiple columns (joint index), the index will play a role in whichever column is queried
Misunderstanding: For the index to work on a multi-column index, the left prefix requirement needs to be met .
Take index(a,b,c) as an example, (note that it depends on the order)
4. Index experiment
For example: select * from t4 where c1=3 and c2 = 4 and c4>5 and c3=2;
Which indexes are used:
explain select * from t4 Where C1 = 3 and C2 = 4 and C4 & GT; 5 and C3 = 2 \ G
## As follows:## Note: (Key_Len: 4)
5. Clustered index and non-clustered index
Myisam and innodb engine, similarities and differences in index filesMyisam: consists of news.myd and new.myi The two files, the index file and the data file, are separate and are called non-clustered indexes. Both the primary index and the secondary index point to the physical row (the location of the disk)innodb: The index and data are gathered together, so it is a clustered index. The row of data is stored directly in the primary index file of innodb, and the secondary index points to a reference to the primary key index.Note: For innodb:
1. The primary key index stores the index value and stores the row data in the leaves. 2. If there is no primary key, unique key will be used as the primary key. 3. If there is no unique, the system generates an internal rowid as the primary key. 4. Like innodb, in the primary key index structure, both the primary key value and the row data are stored. This structure is called a clustered index.Clustered index
Advantages: When there are relatively few query entries based on the primary key, no rowback is needed (the data is under the primary key node)Disadvantages: If When irregular data is inserted, frequent page splits occurRelated articles:Mysql performance optimization
Related videos:MySQL Optimization Video Tutorial
The above is the detailed content of MySQL big data query performance optimization tutorial (picture). For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

How to optimize MySQL connection speed? Overview: MySQL is a widely used relational database management system that is commonly used for data storage and management in a variety of applications. During development, optimization of MySQL connection speed is critical to improving application performance. This article will introduce some common methods and techniques for optimizing MySQL connection speed. Table of Contents: Use connection pools to adjust connection parameters and optimize network settings. Use indexing and caching to avoid long idle connections. Configure appropriate hardware resources. Summary: Use connection pools.

In the current Internet era, the importance of data is self-evident. As one of the core components of Internet applications, database backup and recovery work is particularly important. However, as the amount of data continues to increase and business requirements become increasingly complex, traditional database backup and recovery solutions can no longer meet the high availability and high performance requirements of modern applications. Therefore, optimizing the backup and recovery performance of MySQL database has become an urgent problem that needs to be solved. In practice, we have adopted a series of project experiences to effectively improve MySQL data

MySQL Performance Optimization Practical Guide: In-depth Understanding of B+ Tree Indexes Introduction: As an open source relational database management system, MySQL is widely used in various fields. However, as the amount of data continues to increase and query requirements become more complex, MySQL's performance problems are becoming more and more prominent. Among them, the design and use of indexes are one of the key factors affecting MySQL performance. This article will introduce the principle of B+ tree index and show how to optimize the performance of MySQL with actual code examples. 1. Principle of B+ tree index B+ tree is a

With the rapid development of the Internet, MySQL database has become the core data storage technology for many websites, applications and even enterprises. However, with the continuous growth of data volume and the sharp increase in concurrent access, MySQL's performance problems have become increasingly prominent. PHP's PDO class is also widely used in the development and operation of MySQL because of its efficient and stable performance. In this article, we will introduce how to use the PDO class to optimize MySQL performance and improve the database's response speed and concurrent access capabilities. 1. Introduction to PDO class

In modern applications, the MySQL database is a common choice. However, as data volumes grow and business needs continue to change, MySQL performance may suffer. In order to maintain the high performance of MySQL database, dynamic SQL statements have become an important technical means to improve the performance of MySQL. What is a dynamic SQL statement? A dynamic SQL statement refers to the technology of generating SQL statements by a program in an application. In layman's terms, it means treating SQL statements as strings. For large applications,

MySQL performance optimization: Master the characteristics and advantages of the TokuDB engine Introduction: In large-scale data processing applications, performance optimization of the MySQL database is a crucial task. MySQL provides a variety of engines, each with different features and advantages. This article will introduce the features and advantages of the TokuDB engine and provide some code examples to help readers better understand and apply the TokuDB engine. 1. Characteristics of TokuDB engine TokuDB is a high-performance, high-compression rate storage engine.

With the rapid development of the Internet, the scale of data continues to expand, and the demand for database storage and query efficiency is also increasing. As the most commonly used open source database, MySQL's performance optimization has always been the focus of developers. This article will introduce an effective MySQL performance optimization technology - vertical partition table, and explain in detail how to implement and apply it. 1. What is a vertical partition table? Vertically partitioned tables refer to dividing a table according to column characteristics and storing different columns on different physical storage devices to improve query efficiency.

As a lightweight relational database management system, MySQL database is widely used in Internet applications and enterprise-level systems. In enterprise-level applications, as the amount of data increases, the size of data tables also continues to increase. Therefore, effective management of data table sizes is crucial to ensuring the performance and reliability of the database. This article will introduce data table size management techniques in MySQL. 1. Data table division As the amount of data continues to increase, the size of the data table also continues to increase, which will cause database performance to decrease and query operations to slow down.
