Home > Database > Mysql Tutorial > How to improve the speed of database query for millions of items

How to improve the speed of database query for millions of items

巴扎黑
Release: 2017-05-22 14:05:42
Original
1921 people have browsed it

1. To optimize the query, you should try to avoid full table scans. You should first consider creating indexes on the columns involved in where and order by.

2. Try to avoid making null value judgments on fields in the where clause, otherwise the engine will give up using the index and perform a full table scan, such as:  

select id from t where num is null
Copy after login

can be set on num The default value is 0. Make sure there is no null value in the num column in the table, and then query like this:  

select id from t where num=0
Copy after login

3. Try to avoid using != or <> operators in the where clause, otherwise the engine will give up using the index and perform a full table scan.

4. Try to avoid using or in the where clause to connect conditions, otherwise the engine will give up using the index and perform a full table scan, such as:  

select id from t where num=10 or num=20
Copy after login

You can query like this: 

select id from t where num=10
  union all
  select id from t where num=20
Copy after login

5. In and not in should also be used with caution, otherwise it will lead to a full table scan, such as:  

select id from t where num in(1,2,3)
Copy after login

For continuous values, if you can use between, do not use in:

select id from t where num between 1 and 3
Copy after login

6. The following query will also result in a full table scan:  

select id from t where name like &#39;%abc%&#39;
Copy after login

To improve efficiency, you can consider full-text search.

7. If parameters are used in the where clause, it will also cause a full table scan. Because SQL resolves local variables only at runtime, the optimizer cannot defer selection of an access plan until runtime; it must make the selection at compile time. However, if the access plan is built at compile time, the value of the variable is still unknown and cannot be used as input for index selection. For example, the following statement will perform a full table scan:  

select id from t where num=@num
Copy after login

You can change it to force the query to use the index: 

select id from t with(index(索引名)) where num=@num
Copy after login

8. Try to avoid performing expression operations on fields in the where clause, which will cause the engine to give up using the index and perform a full table scan. For example:  

 select id from t where num/2=100
Copy after login

should be changed to: 

select id from t where num=100*2
Copy after login

9. Try to avoid performing function operations on fields in the where clause, which will cause the engine to give up using the index and perform a full table scan. Such as:  

select id from t where substring(name,1,3)=&#39;abc&#39;--name以abc开头的id
  select id from t where datediff(day,createdate,&#39;2005-11-30&#39;)=0--‘2005-11-30’生成的id
Copy after login

Should be changed to: 

select id from t where name like &#39;abc%&#39;
  select id from t where createdate>=&#39;2005-11-30&#39; and createdate<&#39;2005-12-1&#39;
Copy after login

10. Do not perform functions, arithmetic operations or other expression operations on the left side of "=" in the where clause, otherwise the system may not be able to use the index correctly.

11. When using an index field as a condition, if the index is a composite index, the first field in the index must be used as a condition to ensure that the system uses the index. Otherwise, the index will not be used and should be As much as possible, make the field order consistent with the index order.

12. Do not write meaningless queries. For example, if you need to generate an empty table structure:  

select col1,col2 into #t from t where 1=0
Copy after login

This type of code will not return any result set, but will consume system resources. It should be changed to this:

 create table #t(...)
Copy after login

13. Many times it is a good choice to use exists instead of in:  

 select num from a where num in(select num from b)
Copy after login

Replace with the following statement:

select num from a where exists(select 1 from b where num=a.num)\
Copy after login

14. Not all indexes are effective for queries. SQL optimizes queries based on the data in the table. When there is a large amount of duplicate data in the index column, the SQL query may not use the index, such as a table with fields sex, male, Female is almost half and half, so even if an index is built on sex, it will not have any effect on query efficiency.

15. The more indexes, the better. Although the index can improve the efficiency of the corresponding select, it also reduces the efficiency of insert and update, because the index may be rebuilt during insert or update, so how to build the index needs to be carefully considered. As the case may be. It is best not to have more than 6 indexes on a table. If there are too many, you should consider whether it is necessary to build indexes on some columns that are not commonly used.

16. You should avoid updating clustered index data columns as much as possible, because the order of clustered index data columns is the physical storage order of table records. Once the column value changes, the order of the entire table records will be adjusted, which will consume considerable resources. If the application system needs to frequently update clustered index data columns, then you need to consider whether the index should be built as a clustered index.

17. Try to use numeric fields. If the fields contain only numerical information, try not to design them as character fields. This will reduce the performance of queries and connections, and increase storage overhead. This is because the engine will compare each character in the string one by one when processing queries and connections, and only one comparison is enough for numeric types.

18. Use varchar/nvarchar instead of char/nchar as much as possible, because firstly, the storage space of variable-length fields is small, which can save storage space. Secondly, for queries, the search efficiency in a relatively small field is obviously higher.

19. Do not use select * from t anywhere, replace "*" with a specific field list, and do not return any unused fields.

20. Try to use table variables instead of temporary tables. If the table variable contains a large amount of data, be aware that the indexes are very limited (only primary key indexes).

21. Avoid frequently creating and deleting temporary tables to reduce the consumption of system table resources.

22. Temporary tables are not unusable, and using them appropriately can make certain routines more efficient, for example, when you need to repeatedly reference a data set in a large table or a commonly used table. However, for one-off events, it's better to use an export table.

23. When creating a temporary table, if the amount of data inserted at one time is large, you can use select into instead of create table to avoid causing a large number of logs and improve the speed; if the amount of data is not large, in order to ease the resources of the system table, you should first create table, then insert.

24. If temporary tables are used, all temporary tables must be explicitly deleted at the end of the stored procedure, first truncate table, and then drop table. This can avoid long-term locking of system tables.

25. Try to avoid using cursors because cursors are less efficient. If the data operated by the cursor exceeds 10,000 rows, you should consider rewriting it.

26. Before using the cursor-based method or the temporary table method, you should first look for a set-based solution to solve the problem. The set-based method is usually more effective.

27. Like temporary tables, cursors are not unusable. Using FAST_FORWARD cursors with small data sets is often better than other row-by-row processing methods, especially when several tables must be referenced to obtain the required data. Routines that include "totals" in a result set are usually faster than using a cursor. If development time permits, you can try both the cursor-based method and the set-based method to see which method works better.

28. Set SET NOCOUNT ON at the beginning of all stored procedures and triggers, and set SET NOCOUNT OFF at the end. There is no need to send a DONE_IN_PROC message to the client after each statement of stored procedures and triggers.

29. Try to avoid large transaction operations and improve system concurrency capabilities.

30. Try to avoid returning large amounts of data to the client. If the amount of data is too large, you should consider whether the corresponding requirements are reasonable.

【Related Recommendations】

1. Mysql Free Video Tutorial

##2.

5 Common Database Design Mistakes

3.

How to make Mysql add content sql statements at the end while maintaining the existing content

4.

Summary of how to write commonly used SQL statements in MySQL

5.

Detailed explanation of how to use MySQL to generate random numbers and connect strings

The above is the detailed content of How to improve the speed of database query for millions of items. For more information, please follow other related articles on the PHP Chinese website!

source:php.cn
Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
Popular Tutorials
More>
Latest Downloads
More>
Web Effects
Website Source Code
Website Materials
Front End Template