Home Backend Development PHP Tutorial Sharing tips on crawling web content with PHP_PHP Tutorial

Sharing tips on crawling web content with PHP_PHP Tutorial

Jul 15, 2016 pm 01:32 PM
php Why content share reaction Can how accomplish talent Skill crawl correct of Web page

How to implement it correctlyBut why does PHP not respond after crawling the web content? There is no test text. If I put echo "test"; on the first line, it can be output. I guess the curl_init() function has not been run yet!

See if there is CURL extension support in PHP's phpinfo()!

Copy php_curl.dll to c:windows and c:windowssystem32, restart apache and try again

It is not the file php_curl.dll, but copy libeay32.dll and ssleay32.dll in the php directory to c:windowssystem32 and restart apache

For the sake of server security, allow_url_fopen is turned off.

When the server allow_url_fopen = Off, file_get_contents cannot be used. It can only be used when it is set to ON.

<ol class="dp-xml">
<li class="alt"><span><span class="tag"><</span><span> ?php /*  </span></span></li><li><span>$</span><span class="attribute">getstr</span><span>=</span><span class="attribute-value">file_get_contents</span><span>("http://www.<br />163.com/weatherxml/54511.xml");  </span></li><li class="alt"><span>$</span><span class="attribute">qx</span><span>=</span><span class="attribute-value">explode</span><span>(""",strstr($getstr,"</span><span class="attribute">qx</span><span>="));  </span></li><li><span>$</span><span class="attribute">wd</span><span>=</span><span class="attribute-value">explode</span><span>(""",strstr($getstr,"</span><span class="attribute">wd</span><span>="));  </span></li><li class="alt"><span>$</span><span class="attribute">qximg</span><span>=</span><span class="attribute-value">explode</span><span>(""",strstr($getstr,"</span><span class="attribute">qximg</span><span>="));  </span></li><li><span>$</span><span class="attribute">qximg_</span><span>=</span><span class="attribute-value">explode</span><span>(",",$qximg[1]);  </span></li><li class="alt"><span>echo "北京 ".$qx[1]."";  </span></li><li><span>echo $wd[1];*/  </span></li><li class="alt"><span>//echo "</span><span class="tag"><</span><span> </span><span class="tag-name">img</span><span> </span><span class="attribute">src</span><span>=</span><span class="attribute-value">'http://news.<br />163.com/img/logo/".$qximg_[0]."'</span><span class="tag">><br></span><span class="tag"><</span><span> </span><span class="tag-name">img</span><span> </span><span class="attribute">src</span><span>=</span><span class="attribute-value">'http://news.163.com<br />/img/logo/".$qximg_[1]."'</span><span class="tag">></span><span>";  </span></span></li>
<li>
<span class="tag">?></span><span> </span>
</li>
</ol>
Copy after login

The following example of PHP crawling web content is to obtain the 163 weather forecast through the curl_init function

Put php.ini ( ;extension=php_curl.dll ) Remove the previous (;) and save

Copy php_curl.dll, libeay32.dll, ssleay32.dll to c:windowssystem32 and restart IIS. Apache is not installed

<ol class="dp-xml">
<li class="alt"><span><span class="tag"><</span><span> ?php  </span></span></li><li><span>//初始化curl  </span></li><li class="alt"><span>$</span><span class="attribute">ch</span><span> = </span><span class="attribute-value">curl_init</span><span>() or die (curl_error());  </span></li><li><span>//设置URL参数  </span></li><li class="alt"><span>curl_setopt($ch,CURLOPT_URL,"http:<br />//www.163.com/weatherxml/54511.xml");  </span></li><li><span>//要求CURL返回数据  </span></li><li class="alt"><span>curl_setopt($ch,CURLOPT_RETURNTRANSFER,1);  </span></li><li><span>//执行请求  </span></li><li class="alt"><span>$</span><span class="attribute">result</span><span> = </span><span class="attribute-value">curl_exec</span><span>($ch) or die (curl_error());  </span></li><li><span>//取得返回的结果,并显示  </span></li><li class="alt"><span>//echo $result;  </span></li><li><span>// echo curl_error($ch);  </span></li><li class="alt"><span>$</span><span class="attribute">qx</span><span>=</span><span class="attribute-value">explode</span><span>(""",strstr($result,"</span><span class="attribute">qx</span><span>="));  </span></li><li><span>$</span><span class="attribute">wd</span><span>=</span><span class="attribute-value">explode</span><span>(""",strstr($result,"</span><span class="attribute">wd</span><span>="));  </span></li><li class="alt"><span>$</span><span class="attribute">qximg</span><span>=</span><span class="attribute-value">explode</span><span>(""",strstr($result,"</span><span class="attribute">qximg</span><span>="));  </span></li><li><span>$</span><span class="attribute">qximg_</span><span>=</span><span class="attribute-value">explode</span><span>(",",$qximg[1]);  </span></li><li class="alt"><span>echo "北京 ".$qx[1]."</span><span class="tag"><</span><span> </span><span class="tag-name">br</span><span> </span><span class="tag">/></span><span>";  </span></span></li>
<li><span>echo $wd[1];  </span></li>
<li class="alt"><span>//关闭CURL  </span></li>
<li><span>curl_close($ch);  </span></li>
<li class="alt">
<span class="tag">?></span><span> </span>
</li>
</ol>
Copy after login

Through the above study of PHP crawling web content, you can practice it yourself and deepen your understanding of it.


www.bkjia.comtruehttp: //www.bkjia.com/PHPjc/446179.htmlTechArticleHow to implement it correctly? But why does PHP not respond after crawling the web content? There is no test text. If I put the echo test; it can be output on the first line, I guess...
Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. Best Graphic Settings
2 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Explain JSON Web Tokens (JWT) and their use case in PHP APIs. Explain JSON Web Tokens (JWT) and their use case in PHP APIs. Apr 05, 2025 am 12:04 AM

JWT is an open standard based on JSON, used to securely transmit information between parties, mainly for identity authentication and information exchange. 1. JWT consists of three parts: Header, Payload and Signature. 2. The working principle of JWT includes three steps: generating JWT, verifying JWT and parsing Payload. 3. When using JWT for authentication in PHP, JWT can be generated and verified, and user role and permission information can be included in advanced usage. 4. Common errors include signature verification failure, token expiration, and payload oversized. Debugging skills include using debugging tools and logging. 5. Performance optimization and best practices include using appropriate signature algorithms, setting validity periods reasonably,

Tutorial on how to represent the greatest common divisor in C language functions Tutorial on how to represent the greatest common divisor in C language functions Apr 03, 2025 pm 11:21 PM

Methods to efficiently and elegantly find the greatest common divisor in C language: use phase division to solve by constantly dividing the remainder until the remainder is 0. Two implementation methods are provided: recursion and iteration are concise and clear, and the iterative implementation is higher and more stable. Pay attention to handling negative numbers and 0s, and consider performance optimization, but the phase division itself is efficient enough.

What is the priority of C language !x? What is the priority of C language !x? Apr 03, 2025 pm 02:06 PM

The logical non-operator (!) has the priority next to parentheses, which means that in expressions, it will precede most other operators. Understanding priority not only requires rote memorization, but more importantly, understanding the logic and potential pitfalls behind it to avoid undetectable errors in complex expressions. Adding brackets can clarify expression intent, improve code clarity and maintainability, and prevent unexpected behavior.

Best practices for C language default Best practices for C language default Apr 03, 2025 pm 03:48 PM

The best practices of default in C language: place it at the end of the switch statement as the default processing for unmatched values; it is used to handle unknown or invalid values ​​to improve program robustness; avoid duplication with case branches to maintain conciseness; comment clearly on the purpose of the default branch to improve readability; avoid using multiple defaults in one case to maintain clarity; keep the default branch concise and avoid complex operations; consider using enumeration values ​​as case conditions to improve maintainability; in large switch statements, use multiple default branches to handle different situations.

Front-end FileReader file reading: Why do you need to instantiate first and then read? Front-end FileReader file reading: Why do you need to instantiate first and then read? Apr 04, 2025 pm 01:48 PM

Regarding FileReader instantiation and file reading In front-end development, we often need to process files uploaded by users. use

Why does my RxJS code not take effect when operating on streams? Why does my RxJS code not take effect when operating on streams? Apr 04, 2025 pm 06:27 PM

Why doesn't my code take effect when using RxJS to operate on streams? Learning RxJS...

Unused variables in C/C: Why and how? Unused variables in C/C: Why and how? Apr 03, 2025 pm 10:48 PM

In C/C code review, there are often cases where variables are not used. This article will explore common reasons for unused variables and explain how to get the compiler to issue warnings and how to suppress specific warnings. Causes of unused variables There are many reasons for unused variables in the code: code flaws or errors: The most direct reason is that there are problems with the code itself, and the variables may not be needed at all, or they are needed but not used correctly. Code refactoring: During the software development process, the code will be continuously modified and refactored, and some once important variables may be left behind and unused. Reserved variables: Developers may predeclare some variables for future use, but they will not be used in the end. Conditional compilation: Some variables may only be under specific conditions (such as debug mode)

Explore file splitting and merging: a revolutionary tool for managing large files Explore file splitting and merging: a revolutionary tool for managing large files Apr 04, 2025 am 07:15 AM

Have you ever had difficulties transferring or storing large amounts of files? Learn about FileSplitter & Merger, an open source project designed to simplify this challenge in an elegant and efficient way. What is this project doing? File segmentation and merging are composed of two tools: File splitter—dividing large files into smaller chunks. File Merge - Why is it useful to recombine these blocks into the original file? Working with very large files can be daunting, especially when transferring or storing them. These tools allow you to split files into manageable fragments and then rebuild the original files from those fragments. File splitter This program splits the input file into smaller pieces of a specified size. Each block is saved as a separate file. Usage: Copy the text to be divided

See all articles