


PHP HTML code string interception implementation code_PHP tutorial
而且给的数据是HTML代码串,比如这样:
When intercepting, the content inside the div tag must be intercepted, and the HTML tag must be retained, and only the text in it must be processed. For example, I may just intercept the word "李" in "李思", but if I put it on the front end like this, the a tag in front of "李思" will not be closed, so after intercepting, I need to ensure that the HTML syntax is correct.
This problem is really not easy to solve, and it made me depressed for two days. Please note that this is just a string, but the content is HTML code and there is no DOM. It would be easier if it were processed on the front end. You can directly obtain the DOM, then process the nodes inside, and finally output things like innerHTML. It doesn't work now, I have to change my mind. My colleague’s thinking is this:
Loop through each character of the string. Set a tag, and set it to 1 when it encounters the tag < at the beginning of the tag. The following characters will not be counted, and then start counting after encountering >. When processing the string inside the label, you must first determine whether the current character encoding may be Chinese. Generally speaking, the length of UTF-8 encoded Chinese characters in PHP is 3, so if you encounter a Chinese character encoding , I have to skip two uncounted ones... At this point, my head is starting to get big. Personally, I think this method is very unpleasant. First of all, this kind of exquisite logic is not easy to control, and The length of Chinese generated under UFT-8 encoding may be 3 or 4 so the rigor of the code is questionable. .
My personal idea is to use Tidy (please see the PHP manual for specific usage). I studied Tidy yesterday and found that this thing is quite useful. First, convert this string into a Tidy object, like this:
$tidy = tidy_parse_string($str, array(), 'utf8′); // The last one is to set the encoding. Note that here is utf8, not utf-8, there is no connecting line in the middle.
Then get the body in $tidy (because $tidy will automatically add tags such as
after conversion):$body = tidy_get_body($tidy);
At this time, you can use var_dump to look at some $body structures, and you will find that it turns each tag into a corresponding object with corresponding attributes. For example, such as sdf , some attributes corresponding to such a statement are:
name=>”a”
value => “sdf”
child=> array{[0]=>a Text node object, value is sdf}
attribute=array{”href”=>”#”}
…..Other attributes
As you can see, we can actually process the value of the text node below the node corresponding to the a tag separately, so that it will not destroy any HTML integrity. Originally, I thought that after changing the value of the text node in the a tag, the value of the a tag would also change accordingly. In that case, it would be OK if I directly returned the value of the node corresponding to the a tag. I didn't expect it to be like that. Alas, so I processed the text in it. Then you still have to spell out the new HTML yourself.
After knowing the structure of the Tidy object, everything is easy to handle. Just traverse all the nodes. For this requirement, it is to find the div tag and then start processing the nodes inside. The code is as follows:
if(mb_strwidth($subchild->value, 'utf-8′) >= $len)
{
$subchild->value = mb_strimwidth($subchild->value, 0 , $len, '…', 'utf-8′);
$trimed_str .= $subchild->value;
break;
}
else
{
$ trimed_str .= $subchild->value;
$len = $len - mb_strwidth($subchild->value, 'utf-8′);
}
The $subchild inside is a child node. Note that mb_strwidth is used here to get the string length. I highly recommend this mb_strwidth, it’s very useful. It will treat Chinese as two characters in length, which exactly meets the needs here! Moreover, mb_strimwidth is used when intercepting strings. This function will also treat Chinese as two characters in length. Functions starting with mb_ are really easy to use.
I won’t write out the specific code, because it is written based on a requirement and is not made into a universal form. One day I have time to make it universal and publish it.
In addition, it is a pity that FireFox does not support the text-overflow attribute, otherwise there would be no need to work so hard in the background to truncate it. If you have a better method, please feel free to suggest it! Any help is greatly appreciated.

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

In this chapter, we will understand the Environment Variables, General Configuration, Database Configuration and Email Configuration in CakePHP.

PHP 8.4 brings several new features, security improvements, and performance improvements with healthy amounts of feature deprecations and removals. This guide explains how to install PHP 8.4 or upgrade to PHP 8.4 on Ubuntu, Debian, or their derivati

To work with date and time in cakephp4, we are going to make use of the available FrozenTime class.

To work on file upload we are going to use the form helper. Here, is an example for file upload.

In this chapter, we are going to learn the following topics related to routing ?

CakePHP is an open-source framework for PHP. It is intended to make developing, deploying and maintaining applications much easier. CakePHP is based on a MVC-like architecture that is both powerful and easy to grasp. Models, Views, and Controllers gu

Validator can be created by adding the following two lines in the controller.

Visual Studio Code, also known as VS Code, is a free source code editor — or integrated development environment (IDE) — available for all major operating systems. With a large collection of extensions for many programming languages, VS Code can be c
