Systematically learn JS regular expressions-JS Tutorial-php.cn

Table of Contents

一、正则表达式简介

1、什么是正则表达式

2、可视化正则表达式工具

1、字面量

标志

三、元字符

1、原义文本字符

2、元字符

四、字符类

例子

字符类取反

五、范围类

疑问

六、预定义类

七、单词边界

八、量词

九、贪婪模式

十、非贪婪模式

十一、分组

1、或

2、反向引用

3、忽略分组

十二、前瞻

十三、RegExp对象属性

十四、RegExp对象方法

1、RegExp.prototype.test()

2、RegExp.prototype.exec()

非全局情况

全局情况

1、String.prototype.search()

2、String.prototype.match()

非全局调用的情况

全局调用的情况

3、String.prototype.split()

4、String.prototype.replace()

常规用法

精细化用法

Home

Web Front-end

JS Tutorial

Systematically learn JS regular expressions

php中世界最好的语言

May 24, 2018 pm 03:12 PM

javascript expression

这次给大家带来系统性学习JS正则表达式，学习JS正则表达式的注意事项有哪些，下面就是实战案例，一起来看一下。

一、正则表达式简介

1、什么是正则表达式

正则表达式，又称规则表达式。（英语：Regular Expression，在代码中常简写为regex、regexp或RE），计算机科学的一个概念。正则表达式通常被用来检索、替换那些符合某个模式(规则)的文本。

简单的说，就是按照某种规则去匹配符合条件的字符串。

2、可视化正则表达式工具

Regexper：https://regexper.com/

二、RegExp对象

实例化RegExp的两种方式。

两种方式定义RegExp对象。

1、字面量

let reg = /[a-z]{3}/gmi;
let reg = /[a-z]{3}/g;
let reg = /[a-z]{3}/m;
let reg = /[a-z]{3}/i;

Copy after login

标志

g global 代表全局搜索。如果不添加，搜索到第一个匹配停止。
m Multi-Line 代表多行搜索。
i ignore case 代表大小写不敏感，默认大小写敏感。

2、构造函数

let reg = new RegExp('\\bis\\b', 'g');

Copy after login

因为JavaScript字符串中\属于特殊字符，需要转义。

三、元字符

把元字符当作转义字符。

正则表达式有两种基本字符类型组成。

原义文本字符
元字符

1、原义文本字符

表示原本意义上是什么字符，就是什么字符。

2、元字符

是在正则表达式中有特殊含义的非字母字符。
* + ? $ ^ . | \ ( ) { } [ ]

字符	含义
`\t`	水平制表符
`\v`	垂直制表符
`\n`	换行符
`\r`	回车符
`\0`	空字符
`\f`	换页符
`\cX`	控制字符，与X对应的控制字符(Ctrl + X)

类似于转义字符。

四、字符类

表示符合某种特性的字符类别。

使用元字符[]可以构建一个简单的类。
所谓类是指符合某些特性的对象，一个泛指，而不是某个字符。

例子

表达式[abc]把字符a或b或c归为一类，表达式可以匹配这一类中的任意一个字符。

// replace() 方法用于在字符串中用一些字符替换另一些字符，或替换一个与正则表达式匹配的子串。
'a1b2c3d4e5'.replace(/[abc]/g, '0');  //010203d4e5

Copy after login

字符类取反

我们想要替换不是abc中任意一个字符的字符。

// 元字符 ^ 创建一个 反向类/负向类
'abcdefg'.replace(/[^abc]/g, '0');  //abc0000

Copy after login

五、范围类

匹配这一个范围内的字符。

如果我们想要匹配数字0-9，那么我们可能会这样写[0123456789]。
如果我们想要匹配26个字母，那么我们可能会这样写[abcdefghijklmnopqrstuvwxyz]。
这样略显麻烦，所以才会有范围类。

例子

// 替换所有数字
'a1c2d3e4f5'.replace(/[0-9]/g, 'x');  //axcxdxexfx
// 替换所有小写字母
'a1c2d3e4f5'.replace(/[a-z]/g, 'x');  //x1x2x3x4x5
// []组成的类内部是可以连写的。替换所有大小写字母
'a1C2d3E4f5G6'.replace(/[a-zA-Z]/g, '*');  //*1*2*3*4*5*6

Copy after login

疑问

如果我想替换数字，并且连带-符号也一起替换呢？

// 替换所有数字和横杠
'2018-5-21'.replace(/[0-9-]/g, '*');  //*********

Copy after login

六、预定义类

一些已经定义的类，可以直接使用。

字符	等价类	含义
`.`	`[^\r\n]`	除了回车、换行之外的所有字符
`\d`	`[0-9]`	数字字符
`\D`	`[^0-9]`	非数字字符
`\s`	`[\t\n\x0B\r]`	空白符
`\S`	`[^\t\n\x0B\r]`	非空白符
`\w`	`[a-zA-Z_0-9]`	单词字符(字母、数字、下划线)
`\W`	`[^a-zA-Z_0-9]`	非单词字符

例子

替换一个 ab + 数字 + 任意字符 的字符串

// 写法1
'ab0c'.replace(/ab[0-9][^\r\n]/g, 'TangJinJian');  //TangJianJian
// 写法2
'ab0c'.replace(/ab\d./g, 'TangJinJian');  //TangJianJian

Copy after login

七、单词边界

字符	含义
`^`	以xxx开始（不在中括号内时的含义）
`$`	以xxx结束
`\b`	单词边界
`\B`	非单词边界

例子

我想替换的字符串，属于那种只在开头出现的。

'YuYan is a boy, YuYan'.replace(/^YuYan/g, 'TangJinJian');  //TangJinJian is a boy, YuYan

Copy after login

我想替换的字符串，属于那种只在结尾出现的。

'YuYan is a boy, YuYan'.replace(/YuYan$/g, 'TangJinJian');  //YuYan is a boy, TangJinJian

Copy after login

单词边界例子。

// 替换所有is为0
'This is a man'.replace(/is/g, '0');  //Th0 0 a man
// 替换所有is前面带有单词边界的字符串
'This is a man'.replace(/\bis/g, '0');  //This 0 a man
// 替换所有is前面没有单词边界的字符串
'This is a man'.replace(/\Bis\b/g, '0');  //Th0 is a man

Copy after login

八、量词

用来处理连续出现的字符串。

字符	含义
`?`	出现零次或一次（最多出现一次）
`+`	出现一次或多次（至少出现一次）
`*`	出现零次或多次（任意次）
`{n}`	出现n次
`{n,m}`	出现n到m次
`{n,}`	至少出现n次

我想替换字符串中连续出现10次的数字为*。

'1234567890abcd'.replace(/\d{10}/, '*');  //*abcd

Copy after login

我想替换字符串中的QQ号码。

'我的QQ是：10000'.replace(/[1-9][0-9]{4,}/, '19216811');  //我的QQ是：19216811

Copy after login

九、贪婪模式

尽可能多的匹配。

有这样的一种场景下的正则表达式，/\d{3,6}/该替换3个数字还是6个数字呢，4、5个数字？

// 贪婪模式会尽可能的往多的方面去匹配
'123456789'.replace(/\d{3,6}/, 'x');  //x789
'123456789'.replace(/\d+/, 'x');  //x
'123456789'.replace(/\d{3,}/, 'x');  //x

Copy after login

十、非贪婪模式

尽可能少的匹配。

如果我们想要最低限度的替换呢？

// 非贪婪模式使用 ? 尽可能的往少的方面去匹配
'12345678'.replace(/\d{3,6}?/g, 'x');  //xx78
'123456789'.replace(/\d{3,6}?/g, 'x');  //xxx

Copy after login

因为有g标志，会匹配这段字符串里所有符合规则的字符串。
第一个规则/\d{3,6}?/g，12345678中有两个符合条件的字符串，是123和456。所以替换结果是xx78。
第二个规则/\d{3,6}?/g，123456789中有三个符合条件的字符串，是123、456和789。所以替换结果是xxx。

十一、分组

括号里的一些规则，分为一组。

我想替换连续出现3次的字母和数字。

//没有分组的情况下，后面的量词，只是表示匹配3次数字。
'a1b2d3c4'.replace(/[a-z]\d{3}/g, '*');  //a1b2d3c4
//有分组的情况下，分组后面的量词，表示符合这个分组里规则的字符串，匹配3次。
'a1b2d3c4'.replace(/([a-z]\d){3}/g, '*');  //*c4

Copy after login

1、或

分组里有两种规则，只要满足其中一种即可匹配。

//我想把ijaxxy和ijcdxy都替换成*
'ijabxyijcdxy'.replace(/ij(ab|cd)xy/g, '*');  //**

Copy after login

2、反向引用

可以把分组视为变量，来引用。

//我想把改变年月日之间的分隔符
'2018-5-22'.replace(/(\d{4})-(\d{1,2})-(\d{1,2})/g, '$1/$2/$3');  //2018/5/22
//我想替换日期，并且更改顺序
'2018-5-22'.replace(/(\d{4})-(\d{1,2})-(\d{1,2})/g, '$2/$3/$1');  //5/22/2018

Copy after login

3、忽略分组

忽略掉分组，不捕获分组，只需要在分组内加上?:

// 忽略掉匹配年的分组后，匹配月的分组变成了$1，日的分组变成了$2
'2018-5-22'.replace(/(?:\d{4})-(\d{1,2})-(\d{1,2})/g, '$1/$2/$3');  //5/22/$3

Copy after login

十二、前瞻

正则表达式从文本头部向尾部开始解析，文本尾部方向，称为“前”。
前瞻就是在正在表达式匹配到规则的时候，向前检查是否符合断言，后顾/后瞻方向相反。
JavaScript不支持后顾。
符合和不符合特定断言称为肯定/正向匹配和否定/负向匹配。

名称	正则	含义
正向前瞻	`exp(?=assert)`
负向前瞻	`exp(?!assert)`
正向后顾	`exp(?<=assert)`	JavaScript不支持
负向后顾	`exp(?<!assert)`	JavaScript不支持

例子

有这样一个单词字符+数字格式的字符串，只要满足这种格式，就把其中的单词字符替换掉。

'a1b2ccdde3'.replace(/\w(?=\d)/g, '*');  //*1*2ccdd*3

Copy after login

有这样一个单词字符+非数字格式的字符串，只要满足这种格式，就把前面的单词字符替换掉。

'a1b2ccdde3'.replace(/\w(?!\d)/g, '*');  //a*b*****e*

Copy after login

十三、RegExp对象属性

global是否全文搜索，默认false。
ignore case是否大小写敏感，默认是false。
multiline多行搜索，默认值是false。
lastIndex是当前表达式匹配内容的最后一个字符的下一个位置。
source正则表达式的文本字符串。

let reg1 = /\w/;
let reg2 = /\w/gim;
reg1.global;  //false
reg1.ignoreCase;  //false
reg1.multiline;  //false
reg2.global;  //true
reg2.ignoreCase;  //true
reg2.multiline;  //true

Copy after login

十四、RegExp对象方法

1、RegExp.prototype.test()

用来查看正则表达式与指定的字符串是否匹配。返回true或false。

let reg1 = /\w/;
reg1.test('a');  //true
reg1.test('*');  //false

Copy after login

加上g标志之后，会有些区别。

let reg1 = /\w/g;
// 第一遍
reg1.test('ab');  //true
// 第二遍
reg1.test('ab');  //true
// 第三遍
reg1.test('ab');  //false
// 第四遍
reg1.test('ab');  //true
// 第五遍
reg1.test('ab');  //true
// 第六遍
reg1.test('ab');  //false

Copy after login

实际上这是因为RegExp.lastIndex。每次匹配到之后，lasgIndex会改变。
lastIndex是正则表达式的一个可读可写的整型属性，用来指定下一次匹配的起始索引。

let reg = /\w/g;
// 每次匹配到，就会把lastIndex指向匹配到的字符串后一个字符的索引。
while(reg.test('ab')) {
    console.log(reg.lastIndex);
}
// 1
// 2

Copy after login

reg.lastIndex初始时为0，第一个次匹配到a的时候，reg.lastIndex为1。第二次匹配到b的时候，reg.lastIndex为2。

let reg = /\w\w/g;
while(reg.test('ab12cd')) {
  console.log(reg.lastIndex);
}
// 2
// 4
// 6

Copy after login

reg.lastIndex初始时为0，第一个次匹配到ab的时候，reg.lastIndex为2。第二次匹配到12的时候，reg.lastIndex为4。第三次匹配到cd的时候，reg.lastIndex为6。

let reg = /\w/g;
// 匹配不到符合正则的字符串之后，lastIndex会变为0。
while(reg.test('ab')) {
    console.log(reg.lastIndex);
}
console.log(reg.lastIndex);
reg.test('ab');
console.log(reg.lastIndex);
// 1
// 2
// 0
// 1

Copy after login

所以，这就是为什么reg.test('ab')再多次执行之后，返回值为false的原因了。

let reg = /\w/g;
reg.lastIndex = 2;
reg.test('ab');  //false

Copy after login

每次匹配的起始位置，是以lastIndex为起始位置的。上述例子，一开始从位置2开始匹配，位置2后面没有符合正则的字符串，所以为false。

2、RegExp.prototype.exec()

在一个指定字符串中执行一个搜索匹配。返回一个搜索的结果数组或null。

非全局情况

let reg = /\d(\w)\d/;
let ts = '*1a2b3c';
let ret = reg.exec(ts);  //ret是结果数组
// reg.lastIndex肯定是0，因为没有g标志。 没有g标志的情况下，lastIndex被忽略。
console.log(reg.lastIndex + '\t' + ret.index + '\t' + ret.toString());
console.log(ret);
// 0  1 1a2,a
// ["1a2", "a"]

Copy after login

返回数组是有以下元素组成的：

第一个元素是与正则表达式相匹配的文本。
第二个元素是reg对象的第一个子表达式相匹配的文本（如果有的话）。
第二个元素是reg对象的第二个子表达式相匹配的文本（如果有的话），以此类推。

// 子表达式就是分组。
let reg = /\d(\w)(\w)(\w)\d/;
let ts = '*1a2b3c';
let ret = reg.exec(ts);
console.log(reg.lastIndex + '\t' + ret.index + '\t' + ret.toString());
console.log(ret);  //输出结果数组
// 0  1 1a2b3,a,2,b
// ["1a2b3", "a", "2", "b"]

Copy after login

全局情况

let reg = /\d(\w)(\w)(\w)\d/g;
let ts = '*1abc25def3g';
while(ret = reg.exec(ts)) {
    console.log(reg.lastIndex + '\t' + ret.index + '\t' + ret.toString());
}
// 6  1 1abc2,a,b,c
// 11 6 5def3,d,e,f

Copy after login

第一次匹配的是1abc2，1abc2的后一个字符的起始位置是6，所以reg.lastIndex是6。
1abc2的第一个字符的起始位置是1，所以ret.index是1。

第二次匹配的是5def3，5def3的后一个字符的起始位置是11，所以reg.lastIndex是11。
5def3的第一个字符的起始位置是6，所以ret.index是6。

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)

2 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

How Long Does It Take To Beat Split Fiction?

1 months ago By DDD

R.E.P.O. Best Graphic Settings

2 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Assassin's Creed Shadows: Seashell Riddle Solution

1 weeks ago By DDD

R.E.P.O. How to Fix Audio if You Can't Hear Anyone

2 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Zend Studio 13.0.1

Powerful PHP integrated development environment

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Where is the login entrance for gmail email?

7409

Java Tutorial

1631

CakePHP Tutorial

1358

Laravel Tutorial

1268

PHP Tutorial

1218

Related knowledge

How to implement an online speech recognition system using WebSocket and JavaScript Dec 17, 2023 pm 02:54 PM

How to use WebSocket and JavaScript to implement an online speech recognition system Introduction: With the continuous development of technology, speech recognition technology has become an important part of the field of artificial intelligence. The online speech recognition system based on WebSocket and JavaScript has the characteristics of low latency, real-time and cross-platform, and has become a widely used solution. This article will introduce how to use WebSocket and JavaScript to implement an online speech recognition system.

WebSocket and JavaScript: key technologies for implementing real-time monitoring systems Dec 17, 2023 pm 05:30 PM

WebSocket and JavaScript: Key technologies for realizing real-time monitoring systems Introduction: With the rapid development of Internet technology, real-time monitoring systems have been widely used in various fields. One of the key technologies to achieve real-time monitoring is the combination of WebSocket and JavaScript. This article will introduce the application of WebSocket and JavaScript in real-time monitoring systems, give code examples, and explain their implementation principles in detail. 1. WebSocket technology

How to implement an online reservation system using WebSocket and JavaScript Dec 17, 2023 am 09:39 AM

How to use WebSocket and JavaScript to implement an online reservation system. In today's digital era, more and more businesses and services need to provide online reservation functions. It is crucial to implement an efficient and real-time online reservation system. This article will introduce how to use WebSocket and JavaScript to implement an online reservation system, and provide specific code examples. 1. What is WebSocket? WebSocket is a full-duplex method on a single TCP connection.

How to use JavaScript and WebSocket to implement a real-time online ordering system Dec 17, 2023 pm 12:09 PM

Introduction to how to use JavaScript and WebSocket to implement a real-time online ordering system: With the popularity of the Internet and the advancement of technology, more and more restaurants have begun to provide online ordering services. In order to implement a real-time online ordering system, we can use JavaScript and WebSocket technology. WebSocket is a full-duplex communication protocol based on the TCP protocol, which can realize real-time two-way communication between the client and the server. In the real-time online ordering system, when the user selects dishes and places an order

JavaScript and WebSocket: Building an efficient real-time weather forecasting system Dec 17, 2023 pm 05:13 PM

JavaScript and WebSocket: Building an efficient real-time weather forecast system Introduction: Today, the accuracy of weather forecasts is of great significance to daily life and decision-making. As technology develops, we can provide more accurate and reliable weather forecasts by obtaining weather data in real time. In this article, we will learn how to use JavaScript and WebSocket technology to build an efficient real-time weather forecast system. This article will demonstrate the implementation process through specific code examples. We

Simple JavaScript Tutorial: How to Get HTTP Status Code Jan 05, 2024 pm 06:08 PM

JavaScript tutorial: How to get HTTP status code, specific code examples are required. Preface: In web development, data interaction with the server is often involved. When communicating with the server, we often need to obtain the returned HTTP status code to determine whether the operation is successful, and perform corresponding processing based on different status codes. This article will teach you how to use JavaScript to obtain HTTP status codes and provide some practical code examples. Using XMLHttpRequest

How to use insertBefore in javascript Nov 24, 2023 am 11:56 AM

Usage: In JavaScript, the insertBefore() method is used to insert a new node in the DOM tree. This method requires two parameters: the new node to be inserted and the reference node (that is, the node where the new node will be inserted).

How to get HTTP status code in JavaScript the easy way Jan 05, 2024 pm 01:37 PM

Introduction to the method of obtaining HTTP status code in JavaScript: In front-end development, we often need to deal with the interaction with the back-end interface, and HTTP status code is a very important part of it. Understanding and obtaining HTTP status codes helps us better handle the data returned by the interface. This article will introduce how to use JavaScript to obtain HTTP status codes and provide specific code examples. 1. What is HTTP status code? HTTP status code means that when the browser initiates a request to the server, the service

See all articles

Systematically learn JS regular expressions

一、正则表达式简介

1、什么是正则表达式

2、可视化正则表达式工具

二、RegExp对象

1、字面量

标志

2、构造函数

三、元字符

1、原义文本字符

2、元字符

四、字符类

例子

字符类取反

五、范围类

例子

疑问

六、预定义类

例子

七、单词边界

例子

八、量词

九、贪婪模式

十、非贪婪模式

十一、分组

1、或

2、反向引用

3、忽略分组

十二、前瞻

例子

十三、RegExp对象属性

十四、RegExp对象方法

1、RegExp.prototype.test()

2、RegExp.prototype.exec()

非全局情况

全局情况

十五、字符串对象方法

1、String.prototype.search()

2、String.prototype.match()

非全局调用的情况

全局调用的情况

3、String.prototype.split()

4、String.prototype.replace()

常规用法

精细化用法

Hot AI Tools

Undresser.AI Undress

AI Clothes Remover

Undress AI Tool

Clothoff.io

AI Hentai Generator

Hot Article

Hot Tools

Notepad++7.3.1

SublimeText3 Chinese version

Zend Studio 13.0.1

Dreamweaver CS6

SublimeText3 Mac version

Hot Topics