Home Backend Development PHP Tutorial Regular expressions in PHP

Regular expressions in PHP

Nov 30, 2016 am 11:13 AM
Apple php

PHP inherits the consistent tradition of *NIX and fully supports the processing of regular expressions. Regular expressions provide an advanced, but non-intuitive, method of string matching and processing. Friends who have used PERL's regular expressions know that regular expressions are very powerful, but they are not easy to learn.
For example:

^.+@.+..+$

This effective but incomprehensible code is enough to give some programmers a headache (me) or make them give up using regular expressions. I believe that after reading this tutorial, you will be able to understand the meaning of this code.

Basic Pattern Matching

Everything starts from the basics. Patterns are the most basic elements of regular expressions. They are a set of characters that describe the characteristics of a string. Patterns can be simple, consisting of ordinary strings, or very complex, often using special characters to represent a range of characters, recurrences, or to represent context. For example:

^once

This pattern contains a special character ^, which means that the pattern only matches those strings starting with once. For example, this pattern matches the string "once upon a time" but does not match "There once was a man from NewYork". Just like the ^ symbol indicates the beginning, the $ symbol matches strings that end with a given pattern.

bucket$

This pattern matches "Who kept all of this cash in a bucket" but does not match "buckets". When the characters ^ and $ are used together, they represent an exact match (strings are the same as patterns). For example:

^bucket$

only matches the string "bucket". If a pattern does not include ^ and $, then it matches any string that contains the pattern. For example: the pattern

once

matches the string

There once was a man from NewYork
Who kept all of his cash in a bucket.

.

The letters (o-n-c-e) in this pattern are literal characters, that is, they represent the letters themselves, and the same goes for numbers. Other slightly more complex characters, such as punctuation and white characters (spaces, tabs, etc.), require escape sequences. All escape sequences begin with a backslash (). The escape sequence for the tab character is: t. So if we want to detect whether a string starts with a tab character, we can use this pattern:

^t

Similarly, use n to represent "new line" and r to represent carriage return. Other special symbols can be used with a backslash in front. For example, the backslash itself is represented by ., the period is represented by ., and so on.

Character cluster

In INTERNET programs, regular expressions are usually used to verify user input. When a user submits a FORM, it is not enough to use ordinary literal characters to determine whether the entered phone number, address, email address, credit card number, etc. are valid.

So we need to use a more free way to describe the pattern we want, which is character clusters. To create a cluster that represents all vowel characters, place all vowel characters in square brackets:

[AaEeIiOoUu]

This pattern matches any vowel character, but can only represent one character. Use hyphens to represent a range of characters, such as:

[a-z] // Match all lowercase letters

[A-Z] // Match all uppercase letters

[a-zA-Z] // Match all Letters

[0-9] //Match all numbers

[0-9.-] //Match all numbers, periods and minus signs

[frtn] //Match all white characters

Same , these also represent only one character, which is a very important one. If you want to match a string consisting of a lowercase letter and a digit, such as "z2", "t6" or "g7", but not "ab2", "r2d3" or "b52", use this pattern:

^[a-z][0-9]$

Although [a-z] represents a range of 26 letters, here it can only match strings where the first character is a lowercase letter.

It was mentioned earlier that ^ represents the beginning of a string, but it also has another meaning. When ^ is used within a set of square brackets, it means "not" or "exclude" and is often used to eliminate a certain character. Using the previous example, we require that the first character cannot be a number:

^[^0-9][0-9]$

This pattern matches "&5", "g7" and "-2" , but it does not match "12" and "66". Here are a few examples of excluding specific characters:

[^a-z] //All characters except lowercase letters

[^/^] //All characters except "/" and "^" characters

[^"'] //All characters except double quotes (") and single quotes (')


The special characters "." (dot, period) are used in regular expressions to represent all characters except "new line". So the pattern "^.5$" matches any two-character string that ends with the number 5 and begins with some other non-"newline" character. The pattern "." can match any string, except empty strings and strings containing only a "new line".

PHP’s regular expressions have some built-in universal character clusters, the list is as follows:

Character cluster Meaning

[[:alpha:]] Any letter

[[:digit:]] Any number

[[: alnum:]] Any letters and numbers

[[:space:]] Any white characters

[[:upper:]] Any uppercase letters

[[:lower:]] Any lowercase letters

[[:punct :]] Any punctuation mark

[[:xdigit:]] Any hexadecimal number, equivalent to [0-9a-fA-F]

Determine repeated occurrences

By now, you already know how to match A letter or number, but more often, it may be a word or a group of numbers. A word consists of several letters, and a group of numbers consists of several singular numbers. The curly braces ({}) following a character or character cluster are used to determine the number of times the preceding content is repeated.

Character cluster Meaning

^[a-zA-Z_]$ All letters and underscores

^[[:alpha:]]{3}$ All 3-letter words

^a$ Letter a

^a{4}$ aaaa

^a{2,4}$ aa,aaa or aaaa

^a{1,3}$ a,aa or aaa

^a{2,}$ Contains more than A string of two a's

^a{2,} For example: aardvark and aaab, but apple cannot

.{2} All two characters

These examples describe three different uses of curly braces. A number, {x} means "the preceding character or character cluster appears only x times"; a number plus a comma, {x,} means "the preceding content appears x or more times"; two Comma-separated numbers, {x,y} means "the previous content appears at least x times, but not more than y times". We can extend the pattern to more words or numbers:

^[a-zA-Z0-9_]{1,}$ //All strings containing more than one letter, number or underscore

^[0 -9]{1,}$ //All positive numbers

^-{0,1}[0-9]{1,}$ //All integers

^-{0,1}[0- 9]{0,}.{0,1}[0-9]{0,}$ //All decimals

The last example is not easy to understand, is it? Look at it this way: with everything starting with an optional negative sign (-{0,1}) (^), followed by 0 or more digits ([0-9]{0,}), and an optional A decimal point (.{0,1}) followed by 0 or more digits ([0-9]{0,}) and nothing else ($). Below you will learn about the simpler methods you can use.

The special characters "?" are equal to {0,1}, they both represent: "0 or 1 previous content" or "the previous content is optional". So the example just now can be simplified to:

^-?[0-9]{0,}.?[0-9]{0,}$

The special character "*" is equal to {0,}, They all represent "zero or more previous content". Finally, the character "+" is equal to {1,}, which means "1 or more previous contents", so the above 4 examples can be written as:

^[a-zA-Z0-9_]+$ //All strings containing more than one letter, number or underscore

^[0-9]+$ //All positive numbers

^-?[0-9]+$ //All integers

^-?[0-9]*.?[0-9]*$ //All decimals

Of course this doesn't technically reduce the complexity of regular expressions, but it makes them easier to read.

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

Repo: How To Revive Teammates
1 months ago By 尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
1 months ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

CakePHP Project Configuration CakePHP Project Configuration Sep 10, 2024 pm 05:25 PM

In this chapter, we will understand the Environment Variables, General Configuration, Database Configuration and Email Configuration in CakePHP.

PHP 8.4 Installation and Upgrade guide for Ubuntu and Debian PHP 8.4 Installation and Upgrade guide for Ubuntu and Debian Dec 24, 2024 pm 04:42 PM

PHP 8.4 brings several new features, security improvements, and performance improvements with healthy amounts of feature deprecations and removals. This guide explains how to install PHP 8.4 or upgrade to PHP 8.4 on Ubuntu, Debian, or their derivati

CakePHP Date and Time CakePHP Date and Time Sep 10, 2024 pm 05:27 PM

To work with date and time in cakephp4, we are going to make use of the available FrozenTime class.

CakePHP File upload CakePHP File upload Sep 10, 2024 pm 05:27 PM

To work on file upload we are going to use the form helper. Here, is an example for file upload.

CakePHP Routing CakePHP Routing Sep 10, 2024 pm 05:25 PM

In this chapter, we are going to learn the following topics related to routing ?

Discuss CakePHP Discuss CakePHP Sep 10, 2024 pm 05:28 PM

CakePHP is an open-source framework for PHP. It is intended to make developing, deploying and maintaining applications much easier. CakePHP is based on a MVC-like architecture that is both powerful and easy to grasp. Models, Views, and Controllers gu

CakePHP Creating Validators CakePHP Creating Validators Sep 10, 2024 pm 05:26 PM

Validator can be created by adding the following two lines in the controller.

How To Set Up Visual Studio Code (VS Code) for PHP Development How To Set Up Visual Studio Code (VS Code) for PHP Development Dec 20, 2024 am 11:31 AM

Visual Studio Code, also known as VS Code, is a free source code editor — or integrated development environment (IDE) — available for all major operating systems. With a large collection of extensions for many programming languages, VS Code can be c

See all articles