PHP Regular Expression: How to match all image links in HTML
In HTML pages, we often need to extract image links for use on other occasions, or do some image downloading, batch processing, etc. At this time, PHP regular expressions can help us quickly and accurately match all image links.
1. Analysis of image links in HTML
In HTML, image links usually appear in the form of tags, and their format is as follows:
<img src="image.jpg" alt="图片">
Among them, The src attribute specifies the link address of the image. Generally, the formats of image links are as follows:
- Relative link: /images/picture.jpg
- Absolute link: https://www.example.com/ images/picture.jpg
- Link with parameters: https://www.example.com/images/picture.jpg?size=large
- Relative path link: ../images/picture .jpg
We need to write regular expressions to match these four link formats.
2. PHP regular expression matching image link
There are many kinds of regular expression functions in PHP, among which preg_match() is the most commonly used one and can be used to match from text The specified string. The following is a regular expression that can match the above four image link formats:
$pattern = '/<img.+?src=['"](.+?)['"].*?>/';
This regular expression consists of multiple parts. Let’s explain them one by one:
- < ;img. ?src= matches the
tag and is positioned before the src attribute. Among them, . ? means non-greedy matching of any character until src is encountered.
- ['"] means quotation marks, which can match double quotation marks or single quotation marks.
- (. ?) means matching any character until the next quotation mark is encountered. A capturing group is used here, which can Use the $matches array call in subsequent code.
- .*? means non-greedy matching of any character until the > symbol.
Next, we use the preg_match() function to Extract all image links in HTML:
$html = file_get_contents('example.html'); // 读取 HTML 文件 preg_match_all($pattern, $html, $matches); // 匹配链接 $imgUrls = $matches[1]; // 获取匹配到的链接地址
In this way, we can get an array $imgUrls containing all image links. If you want to only match image links in a certain format, you can do it in a regular expression Some modifications, such as matching only absolute links:
$pattern = '/<img.+?src=['"](https?://.+?)['"].*?>/';
This regular expression increases the restriction of http or https protocol headers and only matches absolute links starting with these two protocols.
Summary
Using PHP regular expressions to match image links in HTML is not a complicated matter. You only need to write the corresponding regular expression according to the link format, and then use the preg_match() function to quickly and accurately extract the All links. If you often need to extract other content from HTML, you can also achieve it through a similar method.
The above is the detailed content of PHP Regular Expression: How to match all image links in HTML. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

PHP 8.4 brings several new features, security improvements, and performance improvements with healthy amounts of feature deprecations and removals. This guide explains how to install PHP 8.4 or upgrade to PHP 8.4 on Ubuntu, Debian, or their derivati

If you are an experienced PHP developer, you might have the feeling that you’ve been there and done that already.You have developed a significant number of applications, debugged millions of lines of code, and tweaked a bunch of scripts to achieve op

Visual Studio Code, also known as VS Code, is a free source code editor — or integrated development environment (IDE) — available for all major operating systems. With a large collection of extensions for many programming languages, VS Code can be c

JWT is an open standard based on JSON, used to securely transmit information between parties, mainly for identity authentication and information exchange. 1. JWT consists of three parts: Header, Payload and Signature. 2. The working principle of JWT includes three steps: generating JWT, verifying JWT and parsing Payload. 3. When using JWT for authentication in PHP, JWT can be generated and verified, and user role and permission information can be included in advanced usage. 4. Common errors include signature verification failure, token expiration, and payload oversized. Debugging skills include using debugging tools and logging. 5. Performance optimization and best practices include using appropriate signature algorithms, setting validity periods reasonably,

This tutorial demonstrates how to efficiently process XML documents using PHP. XML (eXtensible Markup Language) is a versatile text-based markup language designed for both human readability and machine parsing. It's commonly used for data storage an

A string is a sequence of characters, including letters, numbers, and symbols. This tutorial will learn how to calculate the number of vowels in a given string in PHP using different methods. The vowels in English are a, e, i, o, u, and they can be uppercase or lowercase. What is a vowel? Vowels are alphabetic characters that represent a specific pronunciation. There are five vowels in English, including uppercase and lowercase: a, e, i, o, u Example 1 Input: String = "Tutorialspoint" Output: 6 explain The vowels in the string "Tutorialspoint" are u, o, i, a, o, i. There are 6 yuan in total

Static binding (static::) implements late static binding (LSB) in PHP, allowing calling classes to be referenced in static contexts rather than defining classes. 1) The parsing process is performed at runtime, 2) Look up the call class in the inheritance relationship, 3) It may bring performance overhead.

What are the magic methods of PHP? PHP's magic methods include: 1.\_\_construct, used to initialize objects; 2.\_\_destruct, used to clean up resources; 3.\_\_call, handle non-existent method calls; 4.\_\_get, implement dynamic attribute access; 5.\_\_set, implement dynamic attribute settings. These methods are automatically called in certain situations, improving code flexibility and efficiency.
