PHP curl module simulates login and collects page examples_PHP tutorial

WBOY
Release: 2016-07-13 10:43:37
Original
889 people have browsed it

The curl module in PHP is a multi-threaded module. It can easily implement simulated login, such as post and get. Below I will give an example of using the curl module to simulate login and collect page programs.

The homework I received today is to obtain product inventory from a website, but this website requires login. I used fsockopen to pass the entire header to no avail, so I had to resort to curl.

By the way, let’s talk about how to open the curl module:

(1) Copy: libeay32.dll, ssleay32.dll from the php directory to the windows directory.
(2) Open php.ini, search for "extension_dir = xxxxx", and confirm that there is a php_curl.dll file in the following file directory.
(3) The same is php.ini, look for "extension=php_curl.dll" and confirm that it is not commented (there is no ';' in front).
(4) Restart apache. If an error message appears when using the curl_init(); statement, it means the installation was not successful.

Example

The code is as follows Copy code
 代码如下 复制代码

$curl = curl_init();
$cookie_jar = tempnam('./tmp','cookie');
curl_setopt($curl, CURLOPT_URL,'http://www.bKjia.c0m/  checkUser.jsp');//这里写上处理登录的界面
curl_setopt($curl, CURLOPT_POST, 1);
$request = 'user=xxx&password=xxx';
curl_setopt($curl, CURLOPT_POSTFIELDS, $request);//传 递数据
curl_setopt($curl, CURLOPT_COOKIEJAR, $cookie_jar);// 把返回来的cookie信息保存在$cookie_jar文件中
curl_setopt($curl, CURLOPT_RETURNTRANSFER, 1);//设定返回 的数据是否自动显示
curl_setopt($curl, CURLOPT_HEADER, false);//设定是否显示头信 息
curl_setopt($curl, CURLOPT_NOBODY, false);//设定是否输出页面 内容
curl_exec($curl);//返回结果
curl_close($curl); //关闭

$curl2 = curl_init();
curl_setopt($curl2, CURLOPT_URL, 'http://www.bKjia.c0m/ aaa.php');//登陆后要从哪个页面获取信息
curl_setopt($curl2, CURLOPT_HEADER, false);
curl_setopt($curl2, CURLOPT_RETURNTRANSFER, 1);
curl_setopt($curl2, CURLOPT_COOKIEFILE, $cookie_jar);
$content = curl_exec($curl2);

$curl = curl_init();

$cookie_jar = tempnam('./tmp','cookie');
curl_setopt($curl, CURLOPT_URL,'http://www.bKjia.c0m/ checkUser.jsp');//Write the login interface here

curl_setopt($curl, CURLOPT_POST, 1);
 代码如下 复制代码

$content = strstr($orders,'

');//查找首次出现

$request = 'user=xxx&password=xxx'; curl_setopt($curl, CURLOPT_POSTFIELDS, $request);//Transfer data curl_setopt($curl, CURLOPT_COOKIEJAR, $cookie_jar);//Save the returned cookie information in the $cookie_jar file curl_setopt($curl, CURLOPT_RETURNTRANSFER, 1);//Set whether the returned data is automatically displayed curl_setopt($curl, CURLOPT_HEADER, false);//Set whether to display header information curl_setopt($curl, CURLOPT_NOBODY, false);//Set whether to output page content curl_exec($curl);//return result curl_close($curl); //Close $curl2 = curl_init(); curl_setopt($curl2, CURLOPT_URL, 'http://www.bKjia.c0m/ aaa.php');//Which page to get information from after logging in curl_setopt($curl2, CURLOPT_HEADER, false); curl_setopt($curl2, CURLOPT_RETURNTRANSFER, 1); curl_setopt($curl2, CURLOPT_COOKIEFILE, $cookie_jar); $content = curl_exec($curl2);
In this way, the obtained data is passed to $content as a string. Then process the string and delete unnecessary parts. ​I only deleted the unnecessary parts of the front end of the page:
The code is as follows Copy code
$content = strstr($orders,'
');//Find the first occurrence

and delete

Detailed explanation of CURL parameters:

bool curl_setopt (int ch, string option, mixed value)

The curl_setopt() function will set options for a CURL session. The option parameter is the setting you want, and value is the value given by this option.

The values ​​of the following options will be used as long integers (specified in the option parameter):

*CURLOPT_INFILESIZE: When you upload a file to a remote site, this option tells PHP the size of the file you are uploading.
*CURLOPT_VERBOSE: If you want CURL to report every unexpected event, set this option to a non-zero value.
*CURLOPT_HEADER: If you want to include a header in the output, set this option to a non-zero value.
*CURLOPT_NOPROGRESS: If you do not want PHP to display a progress bar for CURL transfers, set this option to a non-zero value.

Note: PHP automatically sets this option to a non-zero value, you should only change this option for debugging purposes.

*CURLOPT_NOBODY: If you do not want to include the body part in the output, set this option to a non-zero value.
*CURLOPT_FAILONERROR: If you want PHP not to display an error (HTTP code return greater than or equal to 300) when an error occurs, set this option to a non-zero value. The default behavior is to return a normal page and ignore the code.
*CURLOPT_UPLOAD: If you want PHP to prepare for uploading, set this option to a non-zero value.
*CURLOPT_POST: If you want PHP to do a regular HTTP POST, set this option to a non-zero value. This POST is of the ordinary application/x-www-from-urlencoded type, mostly used by HTML forms.
*CURLOPT_FTPLISTONLY: Set this option to a non-zero value and PHP will list the directory names for FTP.
*CURLOPT_FTPAPPEND: Set this option to a non-zero value and PHP will apply the remote file instead of overwriting it.
*CURLOPT_NETRC: Set this option to a non-zero value and PHP will look in your ~./netrc file for the username and password of the remote site you want to connect to.
*CURLOPT_FOLLOWLOCATION: Set this option to a non-zero header (like "Location: ") and the server will send it as part of the HTTP header (note that this is recursive, PHP will send a header like "Location: ") .
*CURLOPT_PUT: Set this option to a non-zero value to upload a file using HTTP. To upload this file the CURLOPT_INFILE and CURLOPT_INFILESIZE options must be set.
*CURLOPT_MUTE: Set this option to a non-zero value and PHP will be completely silent for the CURL function.
*CURLOPT_TIMEOUT: Set a long integer as the maximum number of seconds.
*CURLOPT_LOW_SPEED_LIMIT: Set a long integer to control how many bytes are transmitted.
*CURLOPT_LOW_SPEED_TIME: Set a long integer to control how many seconds to transmit the number of bytes specified by CURLOPT_LOW_SPEED_LIMIT.
*CURLOPT_RESUME_FROM: Pass a long parameter containing the byte offset address (the start form you want to transfer to).
*CURLOPT_SSLVERSION: Pass a long parameter containing the SSL version. By default PHP will be determined by its own efforts, in more security you have to set it manually.
*CURLOPT_TIMECONDITION: Pass a long parameter to specify how to handle the CURLOPT_TIMEVALUE parameter. You can set this parameter to TIMECOND_IFMODSINCE or TIMECOND_ISUNMODSINCE. This is only for HTTP.
*CURLOPT_TIMEVALUE: Pass a number of seconds from 1970-1-1 to the present. This time will be used as the specified value by the CURLOPT_TIMEVALUE option, or by the default TIMECOND_IFMODSINCE.

The values ​​of the following options will be treated as strings:

*CURLOPT_URL: This is the URL address you want to retrieve using PHP. You can also set this option during initialization with the curl_init() function.
*CURLOPT_USERPWD: Pass a string in the form of [username]:[password] and use PHP to connect.
*CURLOPT_PROXYUSERPWD: Pass a string in the format [username]:[password] to connect to the HTTP proxy.
*CURLOPT_RANGE: Pass a range you want to specify. It should be in "X-Y" format, with X or Y being excluded. HTTP transfers also support several intervals, separated by commas (X-Y,N-M).
*CURLOPT_POSTFIELDS: Pass a string containing all data as an HTTP "POST" operation.
*CURLOPT_REFERER: A string containing a "referer" header in the HTTP request.
*CURLOPT_USERAGENT: A string containing a "user-agent" header in the HTTP request.
*CURLOPT_FTPPORT: Pass an IP address containing the IP address used by the ftp "POST" command. This POST instruction tells the remote server to connect to the IP address we specified. This string can be an IP address, a host name, a network interface name (under UNIX), or '-' (use the system default IP address).
*CURLOPT_COOKIE: Pass a header containing an HTTP cookie to the connection.
*CURLOPT_SSLCERT: Pass a string containing the certificate in PEM format.
*CURLOPT_SSLCERTPASSWD: Pass a password containing the necessary password to use the CURLOPT_SSLCERT certificate.
*CURLOPT_COOKIEFILE: Pass a string containing the name of the file containing cookie data. This cookie file can be in Netscape format, or it can be a stack of HTTP-style headers stored in the file.
*CURLOPT_CUSTOMREQUEST: When making an HTTP request, pass a character to be used by GET or HEAD. Pass a string to be used instead of GET or HEAD when doing an HTTP request. This is useful for doing or another, more obscure, HTTP request.

Note: Do not do this before confirming that your server supports the command.

The following options require a file description (obtained using the fopen() function):

*CURLOPT_FILE: This file will be the output file you place and transmit. The default is STDOUT.
*CURLOPT_INFILE: This file is the input file you sent.
*CURLOPT_WRITEHEADER: This file contains the header portion of your output.
*CURLOPT_STDERR: This file has errors written instead of stderr.

www.bkjia.comtruehttp: //www.bkjia.com/PHPjc/633158.htmlTechArticleThe curl module in php is a multi-threaded module. It can easily implement simulated login, such as post, Get can be used. Below I will give an example of using the curl module to simulate the collection of pages after logging in...
source:php.cn
Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
Popular Tutorials
More>
Latest Downloads
More>
Web Effects
Website Source Code
Website Materials
Front End Template