Detailed explanation of the use of regular metacharacters
This time I will bring you a detailed explanation of the use of regular metacharacters. What are the precautions when using regular metacharacters? The following is a practical case, let’s take a look.
Note: In all examples, the regular expression matching result is contained between [ and ] in the source text, Some examples will be implemented using Java. If it is the usage of regular expressions in Java itself, it will be explained in the corresponding place. All java examples are tested under JDK1.6.0_13.
1. Escape special characters
Metacharacters are characters that have special meanings in regular expressions. Because metacharacters have special meanings in regular expressions, these characters cannot be used to represent themselves. You can escape a metacharacter by preceding it with a backslash, so that the resulting escape sequence will match that character itself rather than its special metacharacter meaning. For example, if you want to match [and], you must escape it:
and
.
To escape metacharacters, you need to use the slash \ character, which means that the \ character itself is also a metacharacter. To match the \ character itself, it must be escaped into \\. Such as matching windows file path.
2. Match white space characters
Metacharacters can be roughly divided into two types: one is used to match text (such as .), and the other is regular The expression's syntax requires it (such as [and]).
When performing regular expression searches, we often encounter situations where we need to match non-printing whitespace characters in the original text. For example, we may need to find all tab characters, or we need to find newline characters. Such characters are difficult to be directly input into a regular expression. In this case, we can use the special elements listed below. characters to enter them:
\b | Go back (and delete) one character (Backspace key) |
\f | Form feed character |
\n | Line feed character |
\r | Carriage return character |
\t | Tab character (Tab key) |
\v | Vertical Tab |
Let’s look at an example to remove blank lines from the file:
Text:
8 5 4 1 6 3 2 7 9
7 6 2 9 5 8 3 4 1
9 3 1 4 2 7 8 5 6
6 9 3 8 7 5 1 2 4
5 1 8 3 4 2 6 9 7
2 4 7 6 1 9 5 3 8
3 26 7 8 4 9 1 5
4 8 9 5 3 1 7 6 2
1 7 5 2 9 6 4 8 3
Regular expression: \r\n\r\n
Analysis: \r\n matches a carriage return + line feed combination, it is used as the end tag of a text line in the Windows operating system. A search using the regular expression \r\n\r\n will match two consecutive end-of-line tags, which happen to be blank lines.
Note: Unix and Linux operating systems only use a newline character to end a text line. In other words, to match blank lines in Unix or Linux systems, just use \n\n. No need to add \r. Regular expressions applicable to both windows and Unix/Linux should include an optional \r and a must-match \n, that is, \r?\n\r?\n, which will be discussed in a later article .
The Java code is as follows:
public static void matchBlankLine() throws Exception{ BufferedReader br = new BufferedReader(new FileReader(new File("E:/九宫格.txt"))); StringBuilder sb = new StringBuilder(); char[] cbuf = new char[1024]; int len = 0; while(br.ready() && (len = br.read(cbuf)) > 0){ br.read(cbuf); sb.append(cbuf, 0, len); } String reg = "\r\n\r\n"; System.out.println("原内容:\n" + sb.toString()); System.out.println("处理后:-----------------------------"); System.out.println(sb.toString().replaceAll(reg, "\r\n")); }
The running result is as follows:
原内容: 8 5 4 1 6 3 2 7 9 7 6 2 9 5 8 3 4 1 9 3 1 4 2 7 8 5 6 6 9 3 8 7 5 1 2 4 5 1 8 3 4 2 6 9 7 2 4 7 6 1 9 5 3 8 3 2 6 7 8 4 9 1 5 4 8 9 5 3 1 7 6 2 1 7 5 2 9 6 4 8 3 处理后:----------------------------- 8 5 4 1 6 3 2 7 9 7 6 2 9 5 8 3 4 1 9 3 1 4 2 7 8 5 6 6 9 3 8 7 5 1 2 4 5 1 8 3 4 2 6 9 7 2 4 7 6 1 9 5 3 8 3 2 6 7 8 4 9 1 5 4 8 9 5 3 1 7 6 2 1 7 5 2 9 6 4 8 3
3. Match specific character categories
Character sets (matching one of multiple characters) are the most common form of matching, and some commonly used character sets can be replaced by special metacharacters. These metacharacters match a certain class of characters (class metacharacters). Class metacharacters are not essential because you can match a certain class of characters by enumerating the relevant characters one by one or by defining a character range, but using them The constructed regular expression is concise and easy to understand and is commonly used in practical applications.
1. Match numbers and non-numbers
\d Any number, equivalent to any one of [0-9] or [0123456789]
\D Non-digits, equivalent to [^0-9] or [^0123456789]
2. Match letters and numbers with non-letters and numbers
letters (A-Z is not Case-sensitive), numbers, and underscores are a commonly used set of characters. The following metacharacters can be used:
\w Any letter (case-insensitive), numbers, and underscores are equivalent to [0- 9a-zA-Z_]
\W Any non-alphanumeric and underscore, equivalent to [^0-9a-zA-Z_]
3. Matches whitespace characters and non-whitespace characters
\s Any white space character is equivalent to [\f\n\r\t\v]
\S Any white space character is equivalent to [^\f\n \r\t\v]
Note: The backspace metacharacter \b is not within the range of \s.
4. Match hexadecimal or octal values
Hexadecimal: given with the prefix \x, for example: \x0A corresponds to the ASCII character 10 (newline character), its effect is equivalent to \n.
Octal: given with the prefix \0, the value itself can be two or three digits, for example: \011 corresponds to ASCII character 9 (tab), and its effect is equivalent to \t.
4. Use POSIX character classes
POSIX character classes are a shorthand form supported by many regular expression implementations. Java also supports it, but JavaScript does not. POSIX characters are as follows:
[:alnum:] | Any letter or number, equivalent to [a-zA-Z0-9] |
[:alpha:] | Any letter is equivalent to [a-zA-Z] |
Space or tab character, equivalent to [\t] | |
ASCII control character ( ASCII 0 to 31, plus ASCII 127) | |
Any number, equivalent to [0-9] | |
Any printable character, but not including spaces | |
Any lowercase letter, equivalent to [a-z] | |
Any printable character | |
Any character that does not belong to [:alnum:] and [:cntrl:] | |
Any whitespace character, including spaces, is equivalent to [^\f\n\r\t\v] | |
Any uppercase letter is equivalent to [A-Z] | |
Any hexadecimal digit is equivalent to [a- fA-F0-9] |
\p{Alnum} | Alphanumeric characters: [\p{Alpha}\p {Digit}] |
\p{Alpha} | Alphabetic characters: [\p{Lower}\p{Upper}] |
\p{ASCII} | All ASCII: [\x00-\x7F] |
\p{Blank} | space or Tab character: [ \t] |
\p{Cntrl} | Control character: [\x00-\x1F\x7F] |
\p{Digit} | Decimal digits: [0-9] |
Visible characters: [\p{Alnum}\p{Punct}] | |
Lowercase alphabetic characters: [a-z] | |
Printable characters: [\p{Graph}\x20] | |
Punctuation: !"#$%&'()*+,-./:;<=>?@[\]^_`{|}~ | |
White space characters: [ \t\n\x0B\f\r] | |
uppercase Alphabetical characters: [A-Z] | |
Hexadecimal digits: [0-9a-fA-F] |
## I believe you have mastered the method after reading the case in this article. For more exciting information, please pay attention to other related articles on the php Chinese website!
Recommended reading:
Position matching tutorial of regular expression tutorial (with code) JS password strength verification regular expression (with code) Code)The above is the detailed content of Detailed explanation of the use of regular metacharacters. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics











CrystalDiskMark is a small HDD benchmark tool for hard drives that quickly measures sequential and random read/write speeds. Next, let the editor introduce CrystalDiskMark to you and how to use crystaldiskmark~ 1. Introduction to CrystalDiskMark CrystalDiskMark is a widely used disk performance testing tool used to evaluate the read and write speed and performance of mechanical hard drives and solid-state drives (SSD). Random I/O performance. It is a free Windows application and provides a user-friendly interface and various test modes to evaluate different aspects of hard drive performance and is widely used in hardware reviews

foobar2000 is a software that can listen to music resources at any time. It brings you all kinds of music with lossless sound quality. The enhanced version of the music player allows you to get a more comprehensive and comfortable music experience. Its design concept is to play the advanced audio on the computer The device is transplanted to mobile phones to provide a more convenient and efficient music playback experience. The interface design is simple, clear and easy to use. It adopts a minimalist design style without too many decorations and cumbersome operations to get started quickly. It also supports a variety of skins and Theme, personalize settings according to your own preferences, and create an exclusive music player that supports the playback of multiple audio formats. It also supports the audio gain function to adjust the volume according to your own hearing conditions to avoid hearing damage caused by excessive volume. Next, let me help you

MetaMask (also called Little Fox Wallet in Chinese) is a free and well-received encryption wallet software. Currently, BTCC supports binding to the MetaMask wallet. After binding, you can use the MetaMask wallet to quickly log in, store value, buy coins, etc., and you can also get 20 USDT trial bonus for the first time binding. In the BTCCMetaMask wallet tutorial, we will introduce in detail how to register and use MetaMask, and how to bind and use the Little Fox wallet in BTCC. What is MetaMask wallet? With over 30 million users, MetaMask Little Fox Wallet is one of the most popular cryptocurrency wallets today. It is free to use and can be installed on the network as an extension

NetEase Mailbox, as an email address widely used by Chinese netizens, has always won the trust of users with its stable and efficient services. NetEase Mailbox Master is an email software specially created for mobile phone users. It greatly simplifies the process of sending and receiving emails and makes our email processing more convenient. So how to use NetEase Mailbox Master, and what specific functions it has. Below, the editor of this site will give you a detailed introduction, hoping to help you! First, you can search and download the NetEase Mailbox Master app in the mobile app store. Search for "NetEase Mailbox Master" in App Store or Baidu Mobile Assistant, and then follow the prompts to install it. After the download and installation is completed, we open the NetEase email account and log in. The login interface is as shown below

Cloud storage has become an indispensable part of our daily life and work nowadays. As one of the leading cloud storage services in China, Baidu Netdisk has won the favor of a large number of users with its powerful storage functions, efficient transmission speed and convenient operation experience. And whether you want to back up important files, share information, watch videos online, or listen to music, Baidu Cloud Disk can meet your needs. However, many users may not understand the specific use method of Baidu Netdisk app, so this tutorial will introduce in detail how to use Baidu Netdisk app. Users who are still confused can follow this article to learn more. ! How to use Baidu Cloud Network Disk: 1. Installation First, when downloading and installing Baidu Cloud software, please select the custom installation option.

Windows operating system is one of the most popular operating systems in the world, and its new version Win11 has attracted much attention. In the Win11 system, obtaining administrator rights is an important operation. Administrator rights allow users to perform more operations and settings on the system. This article will introduce in detail how to obtain administrator permissions in Win11 system and how to effectively manage permissions. In the Win11 system, administrator rights are divided into two types: local administrator and domain administrator. A local administrator has full administrative rights to the local computer

Apple rolled out the iOS 17.4 update on Tuesday, bringing a slew of new features and fixes to iPhones. The update includes new emojis, and EU users will also be able to download them from other app stores. In addition, the update also strengthens the control of iPhone security and introduces more "Stolen Device Protection" setting options to provide users with more choices and protection. "iOS17.3 introduces the "Stolen Device Protection" function for the first time, adding extra security to users' sensitive information. When the user is away from home and other familiar places, this function requires the user to enter biometric information for the first time, and after one hour You must enter information again to access and change certain data, such as changing your Apple ID password or turning off stolen device protection.

Detailed explanation of division operation in OracleSQL In OracleSQL, division operation is a common and important mathematical operation, used to calculate the result of dividing two numbers. Division is often used in database queries, so understanding the division operation and its usage in OracleSQL is one of the essential skills for database developers. This article will discuss the relevant knowledge of division operations in OracleSQL in detail and provide specific code examples for readers' reference. 1. Division operation in OracleSQL
