Home > Backend Development > PHP Tutorial > Example of parsing and processing HTML/XML for content filtering using PHP

Example of parsing and processing HTML/XML for content filtering using PHP

WBOY
Release: 2023-09-12 10:24:01
Original
758 people have browsed it

Example of parsing and processing HTML/XML for content filtering using PHP

Example of parsing and processing HTML/XML for content filtering using PHP

Introduction:
In web development, we often need to extract data from HTML or XML files Extract specific content from, or filter and process the content. As a powerful server-side scripting language, PHP has many built-in functions and classes for processing HTML/XML, allowing us to easily parse and process HTML/XML files. This article will show you an example of how to parse and process HTML/XML for content filtering using PHP.

1. HTML/XML parsing
In PHP, we can use some built-in functions and classes to parse HTML/XML files, such as file_get_contents()function,SimpleXMLElement classes etc.

Example 1: Parsing HTML
We assume that there is an HTML file named example.html with the following content:

<html>
<body>
  <h1>欢迎来到我的网站</h1>
  <p>这是一个演示HTML解析的示例页面。</p>
  <ul>
    <li>列表项1</li>
    <li>列表项2</li>
    <li>列表项3</li>
  </ul>
</body>
</html>
Copy after login

Now, we want to extract from this HTML Extract all <li> tags under the <h1> tag and the <ul> tag from the file.

<?php
$html = file_get_contents('example.html');

$dom = new DOMDocument();
$dom->loadHTML($html);

$h1 = $dom->getElementsByTagName('h1')->item(0)->nodeValue; // 提取<h1>标签内容

$liList = $dom->getElementsByTagName('li');
foreach ($liList as $li) {
    echo $li->nodeValue . "<br>"; // 遍历输出所有<li>标签内容
}
?>
Copy after login

Running the above PHP code, we can get the following output:

欢迎来到我的网站
列表项1
列表项2
列表项3
Copy after login

Example 2: Parsing XML
Suppose there is an XML file named example.xml , the content is as follows:

<books>
  <book>
    <title>PHP教程</title>
    <author>张三</author>
  </book>
  <book>
    <title>JavaScript教程</title>
    <author>李四</author>
  </book>
  <book>
    <title>Python教程</title>
    <author>王五</author>
  </book>
</books>
Copy after login

Now, we want to extract all the book titles and authors from this XML file.

<?php
$xml = file_get_contents('example.xml');

$dom = new SimpleXMLElement($xml);

foreach ($dom->book as $book) {
    $title = $book->title;
    $author = $book->author;
    
    echo "书名:$title,作者:$author<br>";
}
?>
Copy after login

Running the above PHP code, we can get the following output:

书名:PHP教程,作者:张三
书名:JavaScript教程,作者:李四
书名:Python教程,作者:王五
Copy after login

2. Content filtering
In addition to parsing HTML/XML files, we can also use PHP for content filtering. This is often used in web development to filter user-submitted data to prevent potential security risks.

Example 3: Filtering HTML tags and special characters
Suppose we have a user-submitted text content that contains HTML tags and special characters, and we want to delete or escape these tags and characters.

<?php
$input = "<p><strong>欢迎访问我们的网站!</strong></p>";
$output = strip_tags($input); // 过滤HTML标签
$output = htmlspecialchars($output); // 转义特殊字符

echo $output;
?>
Copy after login

Run the above PHP code, we can get the following output:

欢迎访问我们的网站!
Copy after login

3. Summary
Through the above examples, we learned how to use PHP to parse and process HTML/XML, and how to Perform content filtering. These techniques are very practical in web development and can help us quickly extract and process specific content, as well as ensure the security of user input.

I hope this article will help you understand PHP parsing and processing HTML/XML for content filtering! If you have any questions or suggestions, please feel free to contact us. Thanks for reading!

The above is the detailed content of Example of parsing and processing HTML/XML for content filtering using PHP. For more information, please follow other related articles on the PHP Chinese website!

source:php.cn
Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
Popular Tutorials
More>
Latest Downloads
More>
Web Effects
Website Source Code
Website Materials
Front End Template