Regular expressions involving AND in python

Question

I've been struggling for a while now trying to get the correct regex for the following task: I want to remove data from table tags in an html file using python. My approach to doing this is to do the following recursively (store the HTML lines between tags as strings): s="desired content" reassign the string s to a string that removes everything between the "". s=re.sub('{1}','',s) Repeat this operation until you are left with s="desired content". My question is how to implement the bold part in brackets. Thanks. your text me

P粉348088995 · Answer

To negate a character class, place ^ after [. Additionally, you do not need to specify {1} for characters that occur once.

test_str = re.sub('<[^<>]*>', '', test_str)

However, please note that it is more appropriate to use a dedicated HTML parser like BeautifulSoup instead of regular expressions to get data from HTML.