Since punctuation marks are removed, of course, regular expressions are used.
Regular expression, also known as regular expression. (English: Regular Expression, often abbreviated as regex, regexp or RE in code), a concept in computer science. Regular expressions are often used to retrieve and replace text that matches a certain pattern (rule). (Recommended learning: Python video tutorial)
Regular expression is a logical formula for string operations, which uses some specific characters defined in advance, and these specific A combination of characters forms a "rule string". This "rule string" is used to express a filtering logic for strings.
Given a regular expression and another string, we can achieve the following goals:
1. Whether the given string conforms to the regular expression The filtering logic of expressions (called "matching"):
2. We can get the specific part we want from a string through regular expressions.
You can filter the punctuation marks in the string to remove or replace the punctuation marks.
import re punctuation = '!,;:?"\'' def removePunctuation(text): text = re.sub(r'[{}]+'.format(punctuation),'',text) return text.strip().lower() text = " Hello, world! " print removePunctuation(text)
The characteristics of regular expressions are:
1. Very flexible, logical and functional;
2. Can be quickly Achieve complex control of strings in an extremely simple way.
3. For those who are new to it, it is relatively obscure.
For more Python related technical articles, please visit the Python Tutorial column to learn!
The above is the detailed content of How to remove punctuation marks in python. For more information, please follow other related articles on the PHP Chinese website!