Sharding - Java backend code implementation and best practices after database sharding and table cutting

Question

In the current business, some tables are getting larger and larger, and the pressure is very high when reading (the demand for writing is relatively small), so on the database side, we decided to cut some tables with particularly large amounts of data into tables. However, there are a lot of codes in the back-end code. /Query needs to join these tables. What do you do in this situation...

大家讲道理 · Answer

You can consider introducing database middleware
sharding-jdbc client level
mycat-server server level

世界只因有你 · Answer

A friend recommended Spark, which supports SQl-style queries and returns results in about 0.5 seconds for 100 million pieces of data

ringa_lee · Answer

Only for the current situation in our project: when dividing tables, it falls to a specific table according to the hash algorithm, and then when fetching, first obtain the distribution position of the data according to the algorithm, and then it is a normal selection

漂亮男人 · Answer

Join table query is not recommended
1. Database resources are relatively precious, and join table query will take up a lot of memory, resulting in a decrease in database performance
2. Data is not supported in multiple database instances, the split database situation cannot be handled, and the scalability is poor

The common approach is to divide the join table query into multiple single table queries, and then summarize the results in the application.
1. Can solve the above problem of joining table query
2. For multiple queries, the intermediate results of each query can also be processed in the program, which is a flexibility.
3. The application can also be expanded at any time, making it more flexible

If it is an offline scenario, it is recommended to use the MR (mapreduce) framework to handle it, such as hadoop, etc. Accordingly, the data needs to be written to HDFS.

欧阳克 · Answer

http://blog.csdn.net/tianyale...
Detailed explanation of sub-database and sub-table