PHP Regular Expression: How to match all image links in HTML
In HTML pages, we often need to extract image links for use on other occasions, or do some image downloading, batch processing, etc. At this time, PHP regular expressions can help us quickly and accurately match all image links.
1. Analysis of image links in HTML
In HTML, image links usually appear in the form of tags, and their format is as follows:
<img src="image.jpg" alt="图片">
Among them, The src attribute specifies the link address of the image. Generally, the formats of image links are as follows:
- Relative link: /images/picture.jpg
- Absolute link: https://www.example.com/ images/picture.jpg
- Link with parameters: https://www.example.com/images/picture.jpg?size=large
- Relative path link: ../images/picture .jpg
We need to write regular expressions to match these four link formats.
2. PHP regular expression matching image link
There are many kinds of regular expression functions in PHP, among which preg_match() is the most commonly used one and can be used to match from text The specified string. The following is a regular expression that can match the above four image link formats:
$pattern = '/<img.+?src=['"](.+?)['"].*?>/';
This regular expression consists of multiple parts. Let’s explain them one by one:
- < ;img. ?src= matches the
tag and is positioned before the src attribute. Among them, . ? means non-greedy matching of any character until src is encountered.
- ['"] means quotation marks, which can match double quotation marks or single quotation marks.
- (. ?) means matching any character until the next quotation mark is encountered. A capturing group is used here, which can Use the $matches array call in subsequent code.
- .*? means non-greedy matching of any character until the > symbol.
Next, we use the preg_match() function to Extract all image links in HTML:
$html = file_get_contents('example.html'); // 读取 HTML 文件 preg_match_all($pattern, $html, $matches); // 匹配链接 $imgUrls = $matches[1]; // 获取匹配到的链接地址
In this way, we can get an array $imgUrls containing all image links. If you want to only match image links in a certain format, you can do it in a regular expression Some modifications, such as matching only absolute links:
$pattern = '/<img.+?src=['"](https?://.+?)['"].*?>/';
This regular expression increases the restriction of http or https protocol headers and only matches absolute links starting with these two protocols.
Summary
Using PHP regular expressions to match image links in HTML is not a complicated matter. You only need to write the corresponding regular expression according to the link format, and then use the preg_match() function to quickly and accurately extract the All links. If you often need to extract other content from HTML, you can also achieve it through a similar method.
The above is the detailed content of PHP Regular Expression: How to match all image links in HTML. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

In this chapter, we will understand the Environment Variables, General Configuration, Database Configuration and Email Configuration in CakePHP.

PHP 8.4 brings several new features, security improvements, and performance improvements with healthy amounts of feature deprecations and removals. This guide explains how to install PHP 8.4 or upgrade to PHP 8.4 on Ubuntu, Debian, or their derivati

To work with date and time in cakephp4, we are going to make use of the available FrozenTime class.

To work on file upload we are going to use the form helper. Here, is an example for file upload.

In this chapter, we are going to learn the following topics related to routing ?

CakePHP is an open-source framework for PHP. It is intended to make developing, deploying and maintaining applications much easier. CakePHP is based on a MVC-like architecture that is both powerful and easy to grasp. Models, Views, and Controllers gu

Visual Studio Code, also known as VS Code, is a free source code editor — or integrated development environment (IDE) — available for all major operating systems. With a large collection of extensions for many programming languages, VS Code can be c

Validator can be created by adding the following two lines in the controller.
