Community

Learn

Tools Library

AI Tools

Leisure

English

Home > Backend Development > PHP Tutorial > 100万条记录的文本文件，取出重复数最多的前10条。

100万条记录的文本文件，取出重复数最多的前10条。

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Release： 2016-06-23 13:26:34

Original

1687 people have browsed it

1. 100万条记录的文本文件，取出重复数最多的前10条。
示例文本：
098
123
234
789
……
234
678
654
123

求思路

回复讨论(解决方案)

导入到表中，然后用sql统计，不知道可行不。你可以试试。

导入到表中，然后用sql统计，不知道可行不。你可以试试。

这样肯定可行，但应该不是出题者想要的解决方法。想要采用PHP处理或算法

explode //读取分割成数组
array_count_values//统计重复次数
arsort//排序，得到结果

可以对文本分块处理，记录结果，估计一次性读取的话，内存也吃不住...

可以对文本分块处理，记录结果，估计一次性读取的话，内存也吃不住...

恩，你的方法靠普，能细说一下么

$fp = fopen('文件', 'r');while($buf = fgets($fp)) {  $res[$buf]++;}fclose($fp);arsort($res);$res = array_keys(array_slice($res, 0, 10));print_r($res);

Copy after login

当100万条记录半数是唯一的情况下，与下面的算法没有多大区别

$a = file('文件');$res = array_count_values($a);arsort($res);$res = array_keys(array_slice($res, 0, 10));print_r($res);

Copy after login

先批量插入到数据库,然后使用 sql 语句的 group by 和order by实现

Related labels：

100万条记录的文本文件，取出重复数最多的前10条。

Previous article：Laravel 5系列教程十：实现文章的修改 Next article：5组数字，每组取一个组成不相同的5位数，大神

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Latest Articles by Author

What is a NullPointerException, and how do I fix it?

2024-10-22 09:46:29
From Novice to Coder: Your Journey Begins with C Fundamentals

2024-10-13 13:53:41
Unlocking Web Development with PHP: A Beginner's Guide

2024-10-12 12:15:51
Demystifying C: A Clear and Simple Path for New Programmers

2024-10-11 22:47:31
Unlock Your Coding Potential: C Programming for Absolute Beginners

2024-10-11 19:36:51
Unleash Your Inner Programmer: C for Absolute Beginners

2024-10-11 15:50:41
Automate Your Life with C: Scripts and Tools for Beginners

2024-10-11 15:07:41
PHP Made Easy: Your First Steps in Web Development

2024-10-11 14:21:21
Build Anything with Python: A Beginner's Guide to Unleashing Your Creativity

2024-10-11 12:59:11
The Key to Coding: Unlocking the Power of Python for Beginners

2024-10-11 12:17:31

Latest Issues

Team collaboration - What should I do if someone needs the feature I wrote as a dependency in git flow?

From 1970-01-01 08:00:00

0

0

0

Objective-c - Constraints for iOS a warning issue

From 1970-01-01 08:00:00

0

0

0

Confusion about using gitlab's fork&pull request mode within the team

From 1970-01-01 08:00:00

0

0

0

Objective-c - In iOS development, Instagram cannot be authorized after logging in. Instagram does not jump back to the application. How to get the callback address?

From 1970-01-01 08:00:00

0

0

0

Version Control - About the use of SVN and GIT in company projects?

From 1970-01-01 08:00:00

0

0

0

Related Topics

More>

Popular Recommendations

Popular Tutorials

More>

Related Tutorials

Popular Recommendations

Latest courses

Latest Downloads

More>

Web Effects

Website Source Code

Website Materials

Front End Template