How to insert random string data in mysql_MySQL
Application scenarios:
Sometimes it is necessary to test records inserted into the database for testing, so it is very necessary to use these scripts.
Create table:
1 2 3 4 5 |
|
Create a function that generates random strings:
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 |
|
Create the procedure for inserting the table, where x starts. y is the end value, z is the number of random numbers generated
1 2 3 4 5 6 7 8 |
|
mysql random data generation and insertion
There is very little citation information in the dblp database, with an average of 0.2 citations per paper. A paper using dblp as an experimental data set mentioned that citation information can be added randomly. Inspired by this, I planned to add 20 random citations to each paper, so I wrote the following SQL statement:
String sql = "insert into citation(pId1,pId2) values( (select pId from papers limit ?,1),(select pId from papers limit ?,1))";
Use preparedstatement to submit the database in batch mode.
The first parameter is the rowid information of the paper, from 0 to N (N is the total row of papers). The second parameter is 20 non-repeating random numbers generated by Java, ranging from 0-N. Then nested in a for loop, every 10,000 pieces of data are submitted to the database.
This code cleverly uses the limit feature to randomly select tuples, which is secretly satisfying. I thought that all the selections were done by the database, eliminating the need for multiple connections through jdbc, and it should be able to be completed quickly. Unexpectedly, it took as much as 22 minutes to insert only 100,000 pieces of data (10000*10). The final experiment requires inserting 4 million pieces of data, which means it will take about 14 hours.
So I started to reflect and kept writing similar programs to find the time bottleneck, and finally locked in the select limit. This operation is very time-consuming. The reason for selecting limit at the beginning is that numbers are randomly generated and the numbers need to be mapped to tuples, that is, to rowids. Since the primary key of the papers table is not an incrementing int, the default rowid does not exist. Then I thought, I could add a temp column of auto_increment to the papers table first, and then delete it after completing the citation insertion. In this way, the sql statement is changed to:
String sql = "insert into citation(pId1,pId2) values((select pId from papers where temp=?), (select pId from papers where temp=?))";
Insert 100,000 pieces of data again, which takes 38 seconds. The efficiency has been greatly improved, but I don’t know if it can be further optimized.

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

This article explores optimizing MySQL memory usage in Docker. It discusses monitoring techniques (Docker stats, Performance Schema, external tools) and configuration strategies. These include Docker memory limits, swapping, and cgroups, alongside

This article addresses MySQL's "unable to open shared library" error. The issue stems from MySQL's inability to locate necessary shared libraries (.so/.dll files). Solutions involve verifying library installation via the system's package m

The article discusses using MySQL's ALTER TABLE statement to modify tables, including adding/dropping columns, renaming tables/columns, and changing column data types.

This article compares installing MySQL on Linux directly versus using Podman containers, with/without phpMyAdmin. It details installation steps for each method, emphasizing Podman's advantages in isolation, portability, and reproducibility, but also

This article provides a comprehensive overview of SQLite, a self-contained, serverless relational database. It details SQLite's advantages (simplicity, portability, ease of use) and disadvantages (concurrency limitations, scalability challenges). C

Article discusses configuring SSL/TLS encryption for MySQL, including certificate generation and verification. Main issue is using self-signed certificates' security implications.[Character count: 159]

This guide demonstrates installing and managing multiple MySQL versions on macOS using Homebrew. It emphasizes using Homebrew to isolate installations, preventing conflicts. The article details installation, starting/stopping services, and best pra

Article discusses popular MySQL GUI tools like MySQL Workbench and phpMyAdmin, comparing their features and suitability for beginners and advanced users.[159 characters]
