Why does the query take the same time to get the data even though the number of rows is very different?

Question

I have 29,938,766 rows in my VISITS table which looks like this USER_ID(INT)VISITED_IN(DATETIME)652020-08-2607:57:4311822019-03-1502:46:4815642015-07-0410:59:44732021-03- 1800:25:0837912017-10-1712:22:45512022-05-0219:11:099172017-11-20

P粉381463780 · Answer

INDEX(user_id, visited_in)

will speed up all the SELECTs you mentioned. They will have to scan a large chunk of the index; they won't have to "scan the entire table".

DELETE Requires `INDEX(visited_in). But if you don't run it often enough, problems can arise. This is because deleting thousands of rows at once can be a problem. Consider running the delete operation at least once every hour.

If the table is very large, etc., consider using "time series" partitioning. With DROP PARTITION, the speed is faster. Partition

Any caching service will provide a stale count, but sometimes it is faster.

"The database can be accessed every time someone opens the page", but only if the query is efficient enough. Do indexing.

In my answer to your other question, I explained how summary tables can speed things up even more. However, it assumes that the "last N days" are measured from midnight to midnight. Your current query is NOW() - INTERVAL N DAY. This is more confusing to implement than midnight. Would you like to change the meaning of "last N days"?

(Some INDEX basics...)

An important reason for any index is its ability to quickly find rows based on certain columns.

INDEX is a list of keys mapped to rows.
UNIQUE INDEX is INDEX plus a uniqueness constraint - meaning no two rows in the index have the same value.
UNIQUE PRIMARY KEY is a unique index specified to uniquely identify each row in the table.

"key" and "index" are synonyms.

Indexes (in MySQL's InnoDB engine) are implemented as BTree (actually a B Tree; see Wikipedia). In the case of PK, the remaining columns sit there with the PK value. For "secondary" keys, the "value" part of the BTree is the PK column.

Any index can contain 1 or more columns (called "composite")

INDEX(lastname) Unlikely to be unique INDEX(lastname,firstname) is still unlikely to be unique, but it is "composite".

USER_ID (INT)	VISITED_IN(DATETIME)
65	2020-08-26 07:57:43
1182	2019-03-15 02:46:48
1564	2015-07-04 10:59:44
73	2021-03-18 00:25:08
3791	2017-10-17 12:22:45
51	2022-05-02 19:11:09
917	2017-11-20 15:32:06
3	2019-12-29 15:15:51
51	2015-02-08 17:48:30
1531	2020-08-05 08:44:55
etc...	etc...