python - 一般公司做爬虫采集的话常用什么语言
阿神
阿神 2017-04-17 17:48:02
0
30
1788

一般公司做爬虫采集的话常用什么语言 在京东搜点书全是有关java的

阿神
阿神

闭关修行中......

reply all(30)
刘奇

scrapy +1

It is very convenient to use, has a lot of functions, and the documentation is very clear:

scrapy official website

小葫芦

The questioner has already added the python tag himself, why do you still ask about the language...

PHPzhong

The company I work for uses Java.

黄舟

Using a browser or browser-like method to parse a page is far less fast than regular analysis. If you want to use a selector, you have to build something. This is not a labor-saving job
However, the biggest problem with regular parsing is that once someone else changes the version, you may find it easier to change it

迷茫

nodejs +1

PHPzhong

I know a lot about python, but occasionally I use java

Ty80

I have used nokogiri when writing ruby, but for high efficiency, python is more convenient

PHPzhong

node +1

大家讲道理

Language is not a problem. The specific business depends on the module. There must be a useful http library, a useful concurrency library, a useful job scheduling library, and a useful markup language parsing library. These are all available and the language has good performance. Having a more beautiful syntax depends on whether most people in the company can accept this language. From a broad perspective, python, java, ruby, nodejs, c# all meet these conditions. As for how to choose, it depends on the following conditions.

洪涛

We wrote it in ruby

Latest Downloads
More>
Web Effects
Website Source Code
Website Materials
Front End Template