Home Backend Development PHP Tutorial Quickly understand the techniques of crawling web content with PHP

Quickly understand the techniques of crawling web content with PHP

Jul 25, 2016 am 08:45 AM

How to correctly implement phpto capture web content? This problem is a bit difficult for friends who have not been exposed to PHP language for a long time. KekejieToday I will introduce you to the specific solution.


First, I opened the extension=php_curl.dll function in php.ini in Cwindows, and then restarted apache. The following is the PHP I wrote to capture web content and capture the PHP information in Baidu:

| /setURL Parameters

 curl_setopt($ch,CURLOPT_URL,"http: //http://www.baidu.com/s?wd=php");

   //Require CURL to return data

 curl_setopt ($ch,CURLOPT_RETURNTRANSFER,1);

   //Execute the request

  $result = curl_exec($ch) or die (curl_error());

   //Get the returned result and display it

  echo $result;

  echo curl_error($ch);

   //Close CURL

  curl_close($ch);

 ?>

 But why? Why is there no response after PHP grabs the web content? There is no test text. If I put echo "test"; on the first line, it can be output. I guess the curl_init() function has not been run yet!

See if there is CURL extension support in PHP's phpinfo()!

Copy php_curl.dll to c:windows and c:windowssystem32 and restart apache and try again

It is not the file php_curl.dll. Copy libeay32.dll and ssleay32.dll in the php directory to c:windowssystem32 and restart apache. For the sake of server security, allow_url_fopen is turned off.

  When the server allow_url_fopen = Off, file_get_contents cannot be used. It can only be used when it is set to ON.

  < ?php /*

  $getstr=file_get_contents("http://www. 163.com/weatherxml/54511.xml");

  $qx=explode(""",strstr($getstr,"qx="));

  $wd=explode(""",strstr($getstr,"wd="));

  $qximg=explode(""",strstr($getstr,"qximg="));

  $qximg_=explode(",",$qximg[1]);

echo "Beijing ".$qx[1]."";

  echo $wd[1];*/

  //echo "< img src='http://news. 163.com/img/ logo/".$qximg_[0]."'> < img src='http://news.163.com /img/logo/".$qximg_[1]."'>";

  ?>

  The following example of PHP crawling web content is to obtain the 163 weather forecast through the curl_init function

  Remove the (;) in front of php.ini ( ;extension=php_curl.dll ) and save

 Copy php_curl.dll, libeay32.dll, ssleay32.dll to c:windowssystem32 and restart IIS. Apache is not installed

  < ?php

 

  $ch = curl_init() or die (curl_error());

   //Set URL parameters

  curl_setopt($ch,CURLOPT_URL,"http: //http://www.163.com/ weatherxml/54511.xml");

   //Request CURL to return data

  curl_setopt($ch,CURLOPT_RETURNTRANSFER,1);

   //Execute the request

​$result = curl_exec($ ch) or die (curl_error());

   //Get the returned result and display

  //echo $result;

   // echo curl_error($ch);

$qx=explode(""",strstr($result,"qx="));

  $wd=explode(""",strstr($result,"wd="));

$qximg=explode(""",strstr($result,"qximg="));

  $qximg_=explode(",",$qximg[1]);

echo "Beijing". $qx[1]."< br />";

  echo $wd[1];

   //Close CURL

  curl_close($ch);

  ?> ;

Through the above study of PHP crawling web content, you can practice it yourself and deepen your understanding of it. More related information: http://www.kokojia.com/s64/




Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Explain JSON Web Tokens (JWT) and their use case in PHP APIs. Explain JSON Web Tokens (JWT) and their use case in PHP APIs. Apr 05, 2025 am 12:04 AM

JWT is an open standard based on JSON, used to securely transmit information between parties, mainly for identity authentication and information exchange. 1. JWT consists of three parts: Header, Payload and Signature. 2. The working principle of JWT includes three steps: generating JWT, verifying JWT and parsing Payload. 3. When using JWT for authentication in PHP, JWT can be generated and verified, and user role and permission information can be included in advanced usage. 4. Common errors include signature verification failure, token expiration, and payload oversized. Debugging skills include using debugging tools and logging. 5. Performance optimization and best practices include using appropriate signature algorithms, setting validity periods reasonably,

Describe the SOLID principles and how they apply to PHP development. Describe the SOLID principles and how they apply to PHP development. Apr 03, 2025 am 12:04 AM

The application of SOLID principle in PHP development includes: 1. Single responsibility principle (SRP): Each class is responsible for only one function. 2. Open and close principle (OCP): Changes are achieved through extension rather than modification. 3. Lisch's Substitution Principle (LSP): Subclasses can replace base classes without affecting program accuracy. 4. Interface isolation principle (ISP): Use fine-grained interfaces to avoid dependencies and unused methods. 5. Dependency inversion principle (DIP): High and low-level modules rely on abstraction and are implemented through dependency injection.

How to automatically set permissions of unixsocket after system restart? How to automatically set permissions of unixsocket after system restart? Mar 31, 2025 pm 11:54 PM

How to automatically set the permissions of unixsocket after the system restarts. Every time the system restarts, we need to execute the following command to modify the permissions of unixsocket: sudo...

How to debug CLI mode in PHPStorm? How to debug CLI mode in PHPStorm? Apr 01, 2025 pm 02:57 PM

How to debug CLI mode in PHPStorm? When developing with PHPStorm, sometimes we need to debug PHP in command line interface (CLI) mode...

Explain the concept of late static binding in PHP. Explain the concept of late static binding in PHP. Mar 21, 2025 pm 01:33 PM

Article discusses late static binding (LSB) in PHP, introduced in PHP 5.3, allowing runtime resolution of static method calls for more flexible inheritance.Main issue: LSB vs. traditional polymorphism; LSB's practical applications and potential perfo

How to send a POST request containing JSON data using PHP's cURL library? How to send a POST request containing JSON data using PHP's cURL library? Apr 01, 2025 pm 03:12 PM

Sending JSON data using PHP's cURL library In PHP development, it is often necessary to interact with external APIs. One of the common ways is to use cURL library to send POST�...

Explain late static binding in PHP (static::). Explain late static binding in PHP (static::). Apr 03, 2025 am 12:04 AM

Static binding (static::) implements late static binding (LSB) in PHP, allowing calling classes to be referenced in static contexts rather than defining classes. 1) The parsing process is performed at runtime, 2) Look up the call class in the inheritance relationship, 3) It may bring performance overhead.

See all articles