Google chrome official version code to automatically generate article abstracts [PHP version]

WBOY
Release: 2016-07-29 08:36:51
Original
1079 people have browsed it

Implementation content: Truncate a piece of text containing HTML code, but there will be no problem that the containment tag is not closed.
Note: This is the PHP version, for use on the server side. If you need a client version, please read the next article
When writing programs like BLOG, we often need to display the previous part of the article, but we are afraid Improper truncation destroys the enclosing tag to cause damage to the entire document structure, using my function can solve this problem in a non-demanding situation.
Everyone should consider whether this function is applied on the server side or on the client side. Because I consider that this function may be more machine-intensive to run, it can be placed on the client if security requirements are not high.
It is best to put this summary in a separate field in the data table, so that the corresponding database query is also optimized. It's still a good deal to sacrifice a little space in exchange for a lot of time.
Let’s talk about security issues again, mainly content security. If the client intends to change the normal summary information, generally the owner of the BLOG has this power, and it is his own business to destroy the consistency between the summary and the original text. Security other than content can be solved on the server side. Therefore, it is recommended to use this function on the client side.
Core code

Copy code The code is as follows:


// PHP 4.3 or above needed
define("BRIEF_LENGTH", 800); //Word amount of the Briefing of an Article
function Generate_Brief($text ){
global $Briefing_Length;
if(strlen($text) <= BRIEF_LENGTH ) return $text;
$Foremost = substr($text, 0, BRIEF_LENGTH);
$re = "/<(/?) (P|DIV|H1|H2|H3|H4|H5|H6|ADDRESS|PRE|TABLE|TR|TD|TH|INPUT|SELECT|TEXTAREA|OBJECT|A|UL|OL|LI|BASE|META|LINK |HR|BR|PARAM|IMG|AREA|INPUT|SPAN)[^>]*(>?)/i";
$Single = "/BASE|META|LINK|HR|BR|PARAM|IMG| AREA|INPUT/i";
$Stack = array(); $posStack = array();
preg_match_all($re,$Foremost,$matches, PREG_SET_ORDER | PREG_OFFSET_CAPTURE);
/* [Child-matching Specification]:
$matches[$i][1] : A "/" character indicating whether current "<...>" Friction is Closing Part
$matches[$i][2] : Element Name.
$matches[$ i][3] : Right > of a "<...>" Friction */
for($i = 0; $i < count($matches); $i++){
if($matches [$i][1][0] == ""){
$Elem = $matches[$i][2][0];
if(preg_match($Single,$Elem) && $matches[$i ][3][0] !=""){
                                                                                                                                                                                                                                                           array_push(​ $i][2][1]);

if($matches[$i][3][0] =="") break;
}else{
$StackTop = $Stack[count($Stack)- 1];
$End = strtoupper($matches[$i][2][0]);
if(strcasecmp($StackTop,$End)==0){
array_pop($Stack);
array_pop($ posStack);                                                                                                                               ​​
​}​
​ $cutpos = array_shift( $posStack) - 1;
$Foremost = substr($Foremost,0,$cutpos);
return $Foremost;
};

If you encounter problems (you find that the above function does not support multi-byte character sets well), don't bother trying the following!

Copy the code The code is as follows:


function Generate_Brief($text){
global $Briefing_Length;
mb_regex_encoding("UTF-8");
if(mb_strlen($text) < = BRIEF_LENGTH ) return $text;
$Foremost = mb_substr($text, 0, BRIEF_LENGTH);
$re = "<(/?)(P|DIV|H1|H2|H3|H4|H5|H6|ADDRESS|PRE|TABLE |TR|TD|TH|INPUT|SELECT|TEXTAREA|OBJECT|A|UL|OL|LI|BASE|META|LINK|HR|BR|PARAM|IMG|AREA|INPUT|SPAN)[^>]*( >?)";
$Single = "/BASE|META|LINK|HR|BR|PARAM|IMG|AREA|INPUT|BR/i";
$Stack = array(); $posStack = array();
mb_ereg_search_init($Foremost, $re, 'i');
while($pos = mb_ereg_search_pos()){
$match = mb_ereg_search_getregs();
/* [Child-matching Formulation]:
$matche[1] : A "/" "" Characting WHETHER CURRENT "& LT; ... & GT;" FricTion Is Closing PART
$ Matche [2]: Element name. GT ;" Friction
*/
if($match[1]==""){
$Elem = $match[2];
if(mb_eregi($Single, $Elem) && $match[3] !=" "){
                continue; }else{
                                                                       -1]; e $ END = MB_STRTOUPPER ($ match [2]);
If (STRCASECMP, $ END) == 0) {
Array_pop ($ Stack);

if (



$match[3] ==""){
                                                                   - 1;
$Foremost = mb_substr($ Foremost,0,$cutpos,"UTF-8");
return $Foremost;
};


The above introduces the code for automatically generating article summaries for the official version of Google Chrome [PHP version], including the official version of Google Chrome. I hope it will be helpful to friends who are interested in PHP tutorials.


Related labels:
source:php.cn
Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
Popular Tutorials
More>
Latest Downloads
More>
Web Effects
Website Source Code
Website Materials
Front End Template