Home Backend Development PHP Tutorial PHP Regular Expressions: How to match all headings and paragraphs in HTML

PHP Regular Expressions: How to match all headings and paragraphs in HTML

Jun 22, 2023 pm 07:11 PM
php regular expression html title html paragraph

<p>HTML tags are often used in modern websites, and these tags contain various elements, such as titles, paragraphs, etc. If you are a PHP developer, then you may have encountered the need to extract all headings and paragraph tags from an HTML file for further use. This is where regular expressions come in handy. This article will show you how to use PHP regular expressions to match all headings and paragraphs in HTML. </p> <p>First of all, we need to know what the tags of titles and paragraphs are in HTML. HTML tags contain a variety of title and paragraph elements, the most common of which are h1, h2, h3 and other tags used to represent titles, while p tags are used to represent paragraphs. In this article, we will only focus on these most commonly used tags. </p> <p>Now, let’s see how to use PHP regular expressions to match title and paragraph tags in HTML. The code below shows a simple PHP script that will read an HTML file and match all headings and paragraphs in it using regular expressions: </p><div class="code" style="position:relative; padding:0px; margin:0px;"><pre class='brush:php;toolbar:false;'><?php // 读入 HTML 文件 $html = file_get_contents('example.html'); // 正则表达式匹配所有标题和段落 $pattern = '/<(hd|p)[^>]*>(.*?)</>/si'; preg_match_all($pattern, $html, $matches); // 显示所匹配的结果 print_r($matches[0]); ?></pre><div class="contentsignin">Copy after login</div></div><p>Regular expressions in this code snippet<code>/<(hd|p)[^>]*>(.*?)</ >/si</code> You can do the following things: </p><ul><li> The <code><</code> and <code>></code> metacharacters are used to match the beginning and end of HTML tags. </li><li><code>hd|p</code> means match all heading (h1, h2, h3, etc.) and paragraph (p) tags. </li><li><code>[^>]*</code> matches all characters in the tag except <code>></code>, because <code>></code> is the starting point of the tag. delimiter of the starting position. </li><li><code>(.*?)</code> Use non-greedy matching to match text in the middle of tags. </li><li><code></ ></code> means matching the end tag corresponding to the start tag. <code> </code> means matching the previously specified tag (i.e. <code>hd|p</code>). </li></ul><p>In this regular expression, we use the two pattern modifiers <code>s</code> and <code>i</code>. Among them, <code>s</code> is used to turn on the "dot matching mode" so that the <code>.</code> metacharacter matches all characters, including newlines. And <code>i</code> is used to turn on the "case-insensitive mode" so that the case of the tag name does not affect the matching results. </p><p>When the script finishes running, it will print out all matching headings and paragraph tags. The result will look something like this: </p><div class="code" style="position:relative; padding:0px; margin:0px;"><pre class='brush:php;toolbar:false;'>Array ( [0] => <h1>PHP 正则表达式</h1> [1] => <p>现代网站中常常会用到 HTML 标记,这些标记包含了各种元素,例如标题和段落等等。</p> [2] => <h2>标题2</h2> [3] => <p>段落2</p> [4] => <h3>标题3</h3> [5] => <p>段落3</p> )</pre><div class="contentsignin">Copy after login</div></div><p>With this result, we can see that the PHP regular expression successfully matched all heading and paragraph tags in the HTML. This regular expression has other application scenarios, such as matching links, pictures and tables in HTML. Hopefully this article helped you better understand using PHP regular expressions to match elements in HTML. </p>

The above is the detailed content of PHP Regular Expressions: How to match all headings and paragraphs in HTML. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. Best Graphic Settings
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. How to Fix Audio if You Can't Hear Anyone
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
WWE 2K25: How To Unlock Everything In MyRise
4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

How to verify if input is an IPv6 address using PHP regex How to verify if input is an IPv6 address using PHP regex Jun 25, 2023 am 09:37 AM

IPv6 refers to InternetProtocolVersion6, which is an IP address protocol used for Internet communication. An IPv6 address is a number composed of 128 bits, usually represented by eight hexadecimal number groups. In PHP, you can use regular expressions to verify whether the input is an IPv6 address. Here's how to use PHP regular expressions to verify IPv6 addresses. Step 1: Understand the format of the IPv6 address. The IPv6 address consists of 8 hexadecimal blocks, each

How to verify if string is empty with PHP regular expression How to verify if string is empty with PHP regular expression Jun 24, 2023 am 08:46 AM

In PHP, we can use regular expressions to verify whether a string is empty. Cases where the string is empty include the following: The string contains only spaces. The string length is 0. String is null or undefined. Next, we'll cover how to use regular expressions in PHP to validate these situations. Regular expression: s+ This regular expression can be used to match strings containing only spaces. Where s means matching spaces, + means matching one or more. Code example: functionisEmptySt

How to validate phone number format with PHP regular expression How to validate phone number format with PHP regular expression Jun 24, 2023 am 08:44 AM

When writing web applications, you often need to verify phone numbers. A common method in PHP is to use regular expressions to determine whether the phone number is in the correct format. Regular expressions are a powerful tool that can help you identify certain patterns in concise statements. Below is an example of using regular expressions in PHP to validate phone number format. First, let's define the common format for phone numbers. Phone numbers can contain numbers, parentheses, hyphens, and spaces. A standard phone number should contain 10 digits, preceded by

PHP regular expression to verify whether the input string is in the format of ID number or passport number PHP regular expression to verify whether the input string is in the format of ID number or passport number Jun 24, 2023 pm 12:11 PM

ID number and passport number are common document numbers in people's lives. When implementing functions involving these document numbers, it is often necessary to perform format verification on the entered numbers to ensure their correctness. In PHP, regular expressions can be used to achieve this function. This article will introduce how to use PHP regular expressions to verify whether the input string is in the format of an ID number or passport number. 1. ID card number verification The ID card number is composed of 18 digits and the last digit may be a letter (check code). Its format is as follows: the first 6

How to verify URL address format with PHP regular expression How to verify URL address format with PHP regular expression Jun 24, 2023 am 09:51 AM

With the rapid development of the Internet, URL addresses have become an indispensable part of people's daily lives. In web development, in order to ensure that the URL address entered by the user can be correctly recognized and used by the system, we need to perform format verification on it. This article will introduce how to use PHP regular expressions to verify URL address format. 1. Basic components of URL addresses Before understanding how to verify the URL address format, we first need to understand the basic components of URL addresses. Usually, a standard URL address consists of

How to verify if it is a file path using regular expression in PHP How to verify if it is a file path using regular expression in PHP Jun 24, 2023 am 10:18 AM

In PHP, regular expressions are a commonly used string matching and validation tool. During the development process, the input file path needs to be frequently verified to ensure that it is in the correct format. This article will introduce how to use regular expressions to verify whether a string is a file path. First, we need to determine the basic format of a file path. In Windows systems, a typical file path is in a format similar to "C:ProgramFilesPHPphp.exe". The path is divided into the following parts:

How to validate input of specific length using PHP regex How to validate input of specific length using PHP regex Jun 24, 2023 am 10:17 AM

When developing web applications, it is often necessary to verify that user input conforms to specific format and length requirements. PHP regular expressions provide a powerful method for validation. This article will introduce how to use PHP regular expressions to validate input of a specific length. Determine the input length requirement Before you start writing a regular expression, you need to determine the input length requirement. For example, if the user is asked to enter a password of length 8, then the regular expression should match 8 characters instead of matching a string of 8 characters or more. Write regex

How to verify if input is an IPv4 address using regex in PHP How to verify if input is an IPv4 address using regex in PHP Jun 24, 2023 am 09:20 AM

PHP, as a popular server-side programming language, provides some powerful tools to verify the correctness of input data. In this article, we will focus on how to use regular expressions to verify whether the input is an IPv4 address. First, what is an IPv4 address? An IPv4 address refers to a 32-bit binary number, which is usually divided into four 8-bit binary numbers, separated by ".", and expressed in decimal form. For example, 127.0.0.1 is an IPv4 address. Now, let's see how to use regular expressions to

See all articles