libcurl - php的curl里面在获取页面html数据的时候能指定获取的字节数吗?
RT
本来用的 fopen + fread($fp,读取字节数) 获取数据 SAE 不支持 就想改为curl
我只需要匹配出来title的值就行 去文件的前 800字节就ok了,curl参数众多,不知道该设置哪个。
毕竟获取整个html文件会消耗大量时间,只要前800字节就行,这样应该会节省点时间吧,我用microtime 测试出来的时间差别不大但是还是有差别的
回复内容:
RT
本来用的 fopen + fread($fp,读取字节数) 获取数据 SAE 不支持 就想改为curl
我只需要匹配出来title的值就行 去文件的前 800字节就ok了,curl参数众多,不知道该设置哪个。
毕竟获取整个html文件会消耗大量时间,只要前800字节就行,这样应该会节省点时间吧,我用microtime 测试出来的时间差别不大但是还是有差别的
cURL有一个range选项,计量单位是字节,可以通过如下方式来设定:
<code>curl_setopt($ch, CURLOPT_RANGE, '0-799'); </code>
但是这个不一定管用的,它只是发送了一个请求头,具体如何返回数据还是由发送方决定的,如果发送方支持分片返回则会生效,否则还是完整返回。通过stream也可以实现,也是发送range的头信息,所以结果应该是一样的:
<code>$context = stream_context_create(array('http' => array ('header'=> 'Range: bytes=0-799'))); $data = file_get_contents("http://example.com/file.html", FALSE, $context); </code>
关于range header的rfc文档:http://www.w3.org/Protocols/rfc2616/rfc2616-sec14.html#sec14.35
可以这样..不过每次读取,有可能会超过你规定的数值,判断一下就好了.
<?php error_reporting(E_ALL); $data = ''; $url = 'http://segmentfault.com/q/1010000000482129'; $ch = curl_init(); curl_setopt_array($ch, array( CURLOPT_URL => $url, CURLOPT_WRITEFUNCTION => 'receivePartial', )); curl_exec($ch); curl_close($ch); function receivePartial($ch, $chunk) { global $data; $data .= $chunk; $len = strlen($chunk); echo 'had receive ', $len, ' bytes', PHP_EOL; //判断每次读取,如果总数大于1000,就不再往下读了. if (strlen($data) >= 1000) { return -1; } //返回值是告知CURL,是否已够了,要不要再读啦. return $len; } echo $data;
根据你只想取得页面 title
的需求,使用 file_get_contents
函数是不是更合适?
<code>$content = file_get_contents('http://www.baidu.com', false, null, -1, 800); if(mb_detect_encoding($content) == 'GB2312') $content = iconv('GB2312', 'UTF-8', $content); preg_match("/<title>.*<\/title>/", $content, $title); </code>

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

In this chapter, we will understand the Environment Variables, General Configuration, Database Configuration and Email Configuration in CakePHP.

PHP 8.4 brings several new features, security improvements, and performance improvements with healthy amounts of feature deprecations and removals. This guide explains how to install PHP 8.4 or upgrade to PHP 8.4 on Ubuntu, Debian, or their derivati

To work with date and time in cakephp4, we are going to make use of the available FrozenTime class.

To work on file upload we are going to use the form helper. Here, is an example for file upload.

In this chapter, we are going to learn the following topics related to routing ?

CakePHP is an open-source framework for PHP. It is intended to make developing, deploying and maintaining applications much easier. CakePHP is based on a MVC-like architecture that is both powerful and easy to grasp. Models, Views, and Controllers gu

Validator can be created by adding the following two lines in the controller.

Working with database in CakePHP is very easy. We will understand the CRUD (Create, Read, Update, Delete) operations in this chapter.
