Community

Learn

Tools Library

AI Tools

Leisure

English

Home > Operation and Maintenance > Linux Operation and Maintenance > Export database files under Linux for statistics + deduplication

Export database files under Linux for statistics + deduplication

little bottle

Release： 2019-04-19 13:20:08

forward

4025 people have browsed it

This article mainly talks about how to implement database file statistics and deduplication in Linux. Friends who are interested can learn it!

1. Export the database table to a text file

mysql -h host -P port -u user -p password -A database -e "select email,domain,time from ent_login_01_000" > ent_login_01_000.txt

A total of logged-in users in the last 3 months will be counted, divided into tables by month, and there are 128 tables per month, all exported to files, a total of 80G

2. grep finds all 2018-12 2019-01 2019-02

find ./ -type f -name "ent_login_*" | xargs cat |grep "2018-12" > 2018-12.txt
find ./ -type f -name "ent_login_*" |xargs cat |grep "2019-01" > 2019-01.txt
find ./ -type f -name "ent_login_*" |xargs cat |grep "2019-02" > 2019-02.txt

3. Use awk sort and uniq to only remove the previous user, and First go to the duplicate lines

cat 2019-02.txt|awk -F " " '{print $1"@"$2}'|sort -T /mnt/public/phpdev/187_test/tmp/|uniq > 2019-02-awk-sort-uniq.txt

cat 2019-01.txt|awk -F " " '{print $1"@"$2}'|sort -T /mnt/public/ phpdev/187_test/tmp/|uniq > 2019-01-awk-sort-uniq.txt

cat 2018-12.txt|awk -F " " '{print $1"@"$2}'| sort -T /mnt/public/phpdev/187_test/tmp/|uniq > 2018-12-awk-sort-uniq.txt

uniq only removes consecutive duplicate lines, sort can arrange the lines into consecutive The -T is because the temporary directory of /tmp is occupied by default. The root directory is not enough for me, so I changed the temporary directory.

These files occupy more than 100 G

I want to learn more For Linux tutorials, please pay attention to Linux Video Tutorials on the PHP Chinese website!

The above is the detailed content of Export database files under Linux for statistics + deduplication. For more information, please follow other related articles on the PHP Chinese website!

Related labels：

linux Remove duplicates Export file database

Previous article：[Linux] Use the scp command to upload files to the server Next article：Detailed explanation of four commands for viewing logs in real time on Linux

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Latest Articles by Author

What is mysql slow query

2019-05-31 18:00:19
Is the free version of mysql easy to use?

2019-05-31 17:53:44
How to enter mysql

2019-05-31 17:41:15
How to check mysql installation path

2019-05-31 17:32:51
How to use cmd to enter mysql

2019-05-31 17:24:18
What can mysql do?

2019-05-31 17:15:01
what does vue do

2019-05-31 16:58:16
How to use jquery's after method

2019-05-31 16:37:47
What does prop mean in jquery

2019-05-31 16:19:45
What does jq mean?

2019-05-31 16:04:54

Latest Issues

How do I use sudo to grant elevated privileges to users in Linux?

2025-03-17 17:32:12
How do I implement two-factor authentication (2FA) for SSH in Linux?

2025-03-17 17:31:28
How do I monitor system performance in Linux using tools like top, htop, and vmstat?

2025-03-17 17:28:37
How do I manage software packages in Linux using package managers (apt, yum, dnf)?

2025-03-17 17:26:48
How do I use regular expressions (regex) in Linux for pattern matching?

2025-03-17 17:25:31

Related Topics

More>

Popular Recommendations

Popular Tutorials

More>

Related Tutorials

Popular Recommendations

Latest courses

Python+artificial intelligence full-stack engineer (Linux basics)

319116
Linux step-by-step video tutorial

75158
Linux Basics Advanced Video Tutorial

48617
Linux development video tutorial

39310
Linux load balancing video tutorial

16452

Latest Downloads

More>

Web Effects

Website Source Code

Website Materials

Front End Template