Home Backend Development Python Tutorial Detailed explanation of python regular expressions

Detailed explanation of python regular expressions

Jun 05, 2019 am 11:03 AM
regular expression

Detailed explanation of python regular expressions

The concept of regular expression

Regular expression is a logical formula for string operations, which uses some predefined Specific characters and combinations of these specific characters form a "rule string". This "rule string" is used to express a filtering logic for strings.

The syntax rules of regular expressions

The rough matching process of regular expressions is:

1. Take out the expressions and Comparison of characters in the text,

2. If every character can be matched, the match is successful; once there are characters that are not matched successfully, the match fails.

3. If there are quantifiers or boundaries in the expression, the process will be slightly different.

The following are some matching rules for regular expressions in Python

##$##. Matches except newline characters##re{ n}##re{ n,}##( ?#...)Comments.(?= re)(?! re) (?> re)##\d Matches any number, equivalent to [0-9].##\D##\A#\B##\n, \t, etc.

## Pattern

##Description

^

# Matches the beginning of the string

## Matches the end of the string.

(\n)# Any characters of ##, when the re.DOTALL tag is specified, can match any characters including newlines. ##[...]

is used to represent a group of characters, listed separately: [amk] matches 'a', 'm' or 'k'

[^...]

Characters not in []: [^abc] matches characters other than a, b, c.

re*

##matched 0 or multiple expressions.

re

Matches 1 or more expressions.

##re?

## Matches 0 or 1 fragments defined by the previous regular expression, non-greedy way

##Exactly matches n previous expressions.

##re{ n, m}

Match n to m times the fragment defined by the previous regular expression, greedy way

##a| b

##matches a or b

(re)

##G matches the expression within the brackets , also represents a group

##(?imx)

Regular expressions contain three optional flags: i, m, or x. Only affects the area in brackets.

##(?-imx)

Regular expression turns off the i, m, or x optional flags. Only affects the area in brackets.

(?: re)

Similar to (...), but does not represent a group

# #(?imx: re)

Use i, m, or x optional flags in parentheses

##(?-imx: re)

Do not use i, m, or x optional flags in parentheses

##Forward positive delimiter. Succeeds if the contained regular expression, represented by ... , successfully matches the current position, otherwise it fails. But once the contained expression has been tried, the matching engine doesn't improve at all; the remainder of the pattern still has to try the right side of the delimiter.

Forward negative delimiter. Opposite of positive delimiter; succeeds when the contained expression cannot be matched at the current position in the string

Matching independent pattern, eliminating backtracking.

##\w

Matches letters, numbers and underscores, equivalent to '[A-Za-z0-9_]'.

##\W

Matches non-alphanumeric characters and underscores, equivalent to '[^A-Za-z0-9_]'.

##\s

Matches any whitespace character, equivalent to [\t\n\r\f].

\S

matches any non-empty character, equivalent to [^ \f\n\r\t \v].

matches any non-number, equivalent to [^0-9].

Matches the beginning of the string

##\Z

# Matches the end of the string. If there is a newline, only the end of the string before the newline is matched. c

##\z

##match string End

\G

Match the position where the last match is completed.

#\b

Matches a word boundary, which refers to the position between a word and a space. For example, 'er\b' matches 'er' in "never" but not in "verb".

## Match non-word boundaries. 'er\B' matches 'er' in "verb", but not in "never".

Matches a newline character. Matches a tab character. Wait

##\1...\9

Matches the content of the nth group.

#\10

Match the content of the nth group if it is matched. Otherwise it refers to the expression of the octal character code.

The above is the detailed content of Detailed explanation of python regular expressions. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
1 months ago By 尊渡假赌尊渡假赌尊渡假赌
Two Point Museum: All Exhibits And Where To Find Them
1 months ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

PHP regular expression validation: number format detection PHP regular expression validation: number format detection Mar 21, 2024 am 09:45 AM

PHP regular expression verification: Number format detection When writing PHP programs, it is often necessary to verify the data entered by the user. One of the common verifications is to check whether the data conforms to the specified number format. In PHP, you can use regular expressions to achieve this kind of validation. This article will introduce how to use PHP regular expressions to verify number formats and provide specific code examples. First, let’s look at common number format validation requirements: Integers: only contain numbers 0-9, can start with a plus or minus sign, and do not contain decimal points. floating point

How to validate email address in Golang using regular expression? How to validate email address in Golang using regular expression? May 31, 2024 pm 01:04 PM

To validate email addresses in Golang using regular expressions, follow these steps: Use regexp.MustCompile to create a regular expression pattern that matches valid email address formats. Use the MatchString function to check whether a string matches a pattern. This pattern covers most valid email address formats, including: Local usernames can contain letters, numbers, and special characters: !.#$%&'*+/=?^_{|}~-`Domain names must contain at least One letter, followed by letters, numbers, or hyphens. The top-level domain (TLD) cannot be longer than 63 characters.

PHP regular expressions: exact matching and exclusion of fuzzy inclusions PHP regular expressions: exact matching and exclusion of fuzzy inclusions Feb 28, 2024 pm 01:03 PM

PHP Regular Expressions: Exact Matching and Exclusion Fuzzy inclusion regular expressions are a powerful text matching tool that can help programmers perform efficient search, replacement and filtering when processing text. In PHP, regular expressions are also widely used in string processing and data matching. This article will focus on how to perform exact matching and exclude fuzzy inclusion operations in PHP, and will illustrate it with specific code examples. Exact match Exact match means matching only strings that meet the exact condition, not any variations or extra words.

Master regular expressions and string processing in Go language Master regular expressions and string processing in Go language Nov 30, 2023 am 09:54 AM

As a modern programming language, Go language provides powerful regular expressions and string processing functions, allowing developers to process string data more efficiently. It is very important for developers to master regular expressions and string processing in Go language. This article will introduce in detail the basic concepts and usage of regular expressions in Go language, and how to use Go language to process strings. 1. Regular expressions Regular expressions are a tool used to describe string patterns. They can easily implement operations such as string matching, search, and replacement.

How to match timestamps using regular expressions in Go? How to match timestamps using regular expressions in Go? Jun 02, 2024 am 09:00 AM

In Go, you can use regular expressions to match timestamps: compile a regular expression string, such as the one used to match ISO8601 timestamps: ^\d{4}-\d{2}-\d{2}T \d{2}:\d{2}:\d{2}(\.\d+)?(Z|[+-][0-9]{2}:[0-9]{2})$ . Use the regexp.MatchString function to check if a string matches a regular expression.

How to verify password using regular expression in Go? How to verify password using regular expression in Go? Jun 02, 2024 pm 07:31 PM

The method of using regular expressions to verify passwords in Go is as follows: Define a regular expression pattern that meets the minimum password requirements: at least 8 characters, including lowercase letters, uppercase letters, numbers, and special characters. Compile regular expression patterns using the MustCompile function from the regexp package. Use the MatchString method to test whether the input string matches a regular expression pattern.

How to detect URL with regular expression in Golang? How to detect URL with regular expression in Golang? May 31, 2024 am 10:32 AM

The steps to detect URLs in Golang using regular expressions are as follows: Compile the regular expression pattern using regexp.MustCompile(pattern). Pattern needs to match protocol, hostname, port (optional), path (optional) and query parameters (optional). Use regexp.MatchString(pattern,url) to detect whether the URL matches the pattern.

What are the regular expression wildcards? What are the regular expression wildcards? Nov 17, 2023 pm 01:40 PM

Regular expression wildcards include ".", "*", "+", "?", "^", "$", "[]", "[^]", "[a-z]", "[A-Z] ","[0-9]","\d","\D","\w","\W","\s&quo

See all articles