Solution to generate unique database ID in distributed situation
ID, as the unique identifier of the business, is often seen in data design, for example:
•Product - product_id
•Order - order_id
•Message - message_id
These identifiers are often the primary keys of the database, and MySQL will The primary key is to create a clustered index, which directly points to the data address. Compared with the ordinary index pointing to the clustered index, it reduces one index query and is very fast. Businesses such as messages and orders generally have the need to query data in reverse chronological order. One way is to create an index on the time column, and even better is to rely on the insertion order of the ID itself. Therefore, distributed ID needs to meet two core conditions:
• Globally unique
• Time trend orderly
Some people may say, wouldn’t it be enough to just use MySQL’s auto_increment directly? In the early days of starting a business, I would also choose this solution. It is simple, efficient and fast - startups still have to iterate quickly and produce products as soon as possible, and products change frequently. The awesome architecture that takes too much time to develop may not be useful. Yes, valuable time was wasted. However, there are some problems with this solution:
• Affects parallel insertion - record B depends on the primary key of record A. You need to wait until record A is inserted successfully and get A.id before you can insert record B
• Data recovery is difficult - —After the data is accidentally deleted or lost, since there is no ID in the log, the data correlation cannot be directly determined
• Impact on database and table sharding—Since the ID is not known until it is inserted, database and table sharding cannot be performed based on the primary key of the business
Therefore, after the business is stable, you must take time to pay off early technical debt.
Common solutions
Use the auto_increment of the database to generate a unique ID
Advantages
•Simple, using existing functions, small development effort
•Fixed ID step size
Disadvantages
•Single point of writing, not high Available
• Even if multiple main libraries are expanded according to different auto_increment starting points, although the availability is improved, the strict order of IDs cannot be guaranteed
• The database needs to be accessed every time, and it is easy to reach the performance ceiling
Pulling IDs in batches, Allocate one by one
This solution also stores the ID data in the database. The ID service pulls N IDs from the database each time and updates the current maximum ID value to the original data + N. The ID service receives the ID each time When a request is generated, these N IDs are returned in sequence.
Advantages
•Batch acquisition, no need to access the database every time, low database pressure
Disadvantages
•The entire service is still a single point
•Service downtime and restart will cause ID discontinuity
•Cannot be horizontally expanded
Improvements
Add a set of backup services. If the main service fails and drifts to the backup service, you can use vip + keepalived or add a proxy.
uuid
Advantages
•Locally generated ID, no single point problem, no performance bottleneck
Disadvantages
•Cannot guarantee incremental order
•Length is too long, low performance as a primary key
Snowflake-like algorithm
Snowflake is Twitter's open source distributed ID generation algorithm. Its core idea is: a long ID, using 41 bits as the number of milliseconds, 10 bits as the machine number, and 12 bits as the sequence number within the millisecond. This algorithm can theoretically generate up to 1000*(2^12), or 400W IDs per second on a single machine, which can fully meet business needs.
Learning from snowflake’s ideas and combining the business logic and concurrency of each company, you can implement your own distributed ID generation algorithm.
Advantages
•Time is at a high level, the trend is increasing
•Simple to implement, does not rely on other services, easy to expand
Disadvantages
•There is no global clock, a single machine is absolutely in order, but from the perspective of the entire cluster, the trend is Sequential
Notes
•Since ID is often used as the identifier of the sub-database and sub-table, these IDs need to have a certain degree of randomness so that the data after the sub-database will not be uneven. The sequence number can be different at the beginning of each millisecond. Starting from 1, second is starting from any one from 0-9

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics



MySQL is an open source relational database management system. 1) Create database and tables: Use the CREATEDATABASE and CREATETABLE commands. 2) Basic operations: INSERT, UPDATE, DELETE and SELECT. 3) Advanced operations: JOIN, subquery and transaction processing. 4) Debugging skills: Check syntax, data type and permissions. 5) Optimization suggestions: Use indexes, avoid SELECT* and use transactions.

You can open phpMyAdmin through the following steps: 1. Log in to the website control panel; 2. Find and click the phpMyAdmin icon; 3. Enter MySQL credentials; 4. Click "Login".

MySQL is an open source relational database management system, mainly used to store and retrieve data quickly and reliably. Its working principle includes client requests, query resolution, execution of queries and return results. Examples of usage include creating tables, inserting and querying data, and advanced features such as JOIN operations. Common errors involve SQL syntax, data types, and permissions, and optimization suggestions include the use of indexes, optimized queries, and partitioning of tables.

MySQL is chosen for its performance, reliability, ease of use, and community support. 1.MySQL provides efficient data storage and retrieval functions, supporting multiple data types and advanced query operations. 2. Adopt client-server architecture and multiple storage engines to support transaction and query optimization. 3. Easy to use, supports a variety of operating systems and programming languages. 4. Have strong community support and provide rich resources and solutions.

Redis uses a single threaded architecture to provide high performance, simplicity, and consistency. It utilizes I/O multiplexing, event loops, non-blocking I/O, and shared memory to improve concurrency, but with limitations of concurrency limitations, single point of failure, and unsuitable for write-intensive workloads.

MySQL's position in databases and programming is very important. It is an open source relational database management system that is widely used in various application scenarios. 1) MySQL provides efficient data storage, organization and retrieval functions, supporting Web, mobile and enterprise-level systems. 2) It uses a client-server architecture, supports multiple storage engines and index optimization. 3) Basic usages include creating tables and inserting data, and advanced usages involve multi-table JOINs and complex queries. 4) Frequently asked questions such as SQL syntax errors and performance issues can be debugged through the EXPLAIN command and slow query log. 5) Performance optimization methods include rational use of indexes, optimized query and use of caches. Best practices include using transactions and PreparedStatemen

MySQL and SQL are essential skills for developers. 1.MySQL is an open source relational database management system, and SQL is the standard language used to manage and operate databases. 2.MySQL supports multiple storage engines through efficient data storage and retrieval functions, and SQL completes complex data operations through simple statements. 3. Examples of usage include basic queries and advanced queries, such as filtering and sorting by condition. 4. Common errors include syntax errors and performance issues, which can be optimized by checking SQL statements and using EXPLAIN commands. 5. Performance optimization techniques include using indexes, avoiding full table scanning, optimizing JOIN operations and improving code readability.

Building an SQL database involves 10 steps: selecting DBMS; installing DBMS; creating a database; creating a table; inserting data; retrieving data; updating data; deleting data; managing users; backing up the database.
