What this article brings to you is a summary of commonly used collection functions in PHP (with code). It has certain reference value. Friends in need can refer to it. I hope it will be helpful to you.
I have been paying attention to the collection program of PHP in the past few days, and found that it is so convenient to use PHP to collect content. I will summarize the frequently used collection functions here for future use!
Get all link content and addresses
function getAllURL($code){ preg_match_all('/<a\s+href=["|\']?([^>"\' ]+)["|\']?\s*[^>]*>([^>]+)<\/a>/i',$code,$arr); return array('name'=>$arr[2],'url'=>$arr[1]); }
Get all image addresses
function getImgSrc($code){ $reg = "/]*src=\"(http:\/\/(.+)\/(.+)\.(jpg|gif|bmp|bnp|png))\"/isU"; preg_match_all($reg, $code, $img_array, PREG_PATTERN_ORDER); return $img_array[1]; }
Current script URL
function getSelfURL(){ if(!empty($_SERVER["REQUEST_URI"])){ $scriptName = $_SERVER["REQUEST_URI"]; $nowurl = $scriptName; }else{ $scriptName = $_SERVER["PHP_SELF"]; if(empty($_SERVER["QUERY_STRING"])) $nowurl = $scriptName; else $nowurl = $scriptName."?".$_SERVER["QUERY_STRING"]; } return $nowurl; }
Convert full-width numbers to half-width numbers
function getAlabNum($fnum){ $nums = array("0","1","2","3","4","5","6","7","8","9"); $fnums = "0123456789"; for($i=0;$i<=9;$i++) $fnum = str_replace($nums[$i],$fnums[$i],$fnum); $fnum = ereg_replace("[^0-9\.]|^0{1,}","",$fnum); if($fnum=="") $fnum=0; return $fnum; }
Remove HTML tags
function text2Html($txt){ $txt = str_replace(" "," ",$txt); $txt = str_replace("<","<",$txt); $txt = str_replace(">",">",$txt); $txt = preg_replace("/[\r\n]{1,}/isU","<br/>\r\n",$txt); return $txt; }
Clear HTML tags
function clearHtml($str){ $str = str_replace('<','<',$str); $str = str_replace('>','>',$str); return $str; }
Convert relative path to absolute path
function relative2Absolute($content, $feed_url) { preg_match('/(http|https|ftp):\/\//', $feed_url, $protocol); $server_url = preg_replace("/(http|https|ftp|news):\/\//", "", $feed_url); $server_url = preg_replace("/\/.*/", "", $server_url); if ($server_url == '') { return $content; } if (isset($protocol[0])) { $new_content = preg_replace('/href="\//', 'href="'.$protocol[0].$server_url.'/', $content); $new_content = preg_replace('/src="\//', 'src="'.$protocol[0].$server_url.'/', $new_content); } else { $new_content = $content; } return $new_content; }
Get the content in the specified tag
function getTagData($str, $start, $end){ if ( $start == '' || $end == '' ){ return; } $str = explode($start, $str); $str = explode($end, $str[1]); return $str[0]; }
HTML table Convert each row of the HTML table to a CSV format array
function getTrArray($table) { $table = preg_replace("'<td[^>]*?>'si",'"',$table); $table = str_replace("</td>",'",',$table); $table = str_replace("</tr>","{tr}",$table); //去掉 HTML 标记 $table = preg_replace("'<[\/\!]*?[^<>]*?>'si","",$table); //去掉空白字符 $table = preg_replace("'([\r\n])[\s]+'","",$table); $table = str_replace(" ","",$table); $table = str_replace(" ","",$table); $table = explode(",{tr}",$table); array_pop($table); return $table; }
Convert each row and column of the HTML table to an array and collect table data
function getTdArray($table) { $table = preg_replace("'<table[^>]*?>'si","",$table); $table = preg_replace("'<tr[^>]*?>'si","",$table); $table = preg_replace("'<td[^>]*?>'si","",$table); $table = str_replace("</tr>","{tr}",$table); $table = str_replace("</td>","{td}",$table); //去掉 HTML 标记 $table = preg_replace("'<[\/\!]*?[^<>]*?>'si","",$table); //去掉空白字符 $table = preg_replace("'([\r\n])[\s]+'","",$table); $table = str_replace(" ","",$table); $table = str_replace(" ","",$table); $table = explode('{tr}', $table); array_pop($table); foreach ($table as $key=>$tr) { $td = explode('{td}', $tr); array_pop($td); $td_array[] = $td; } return $td_array; }
Return all words in the string $distinct=true to remove duplicates
function splitEnStr($str,$distinct=true) { preg_match_all('/([a-zA-Z]+)/',$str,$match); if ($distinct == true) { $match[1] = array_unique($match[1]); } sort($match[1]); return $match[1]; }
Related recommendations:
Commonly used functions in PHP acquisition programs
php Use CURL function to collect
The above is the detailed content of Summary of commonly used collection functions in PHP (with code). For more information, please follow other related articles on the PHP Chinese website!