Regular expression matching html to filter illegal characters
To match an html tag, the matching table is as follows:
以上两个表达式,一个加了"?"和一个却不加"?",那么这有什么区别呢?
我们知道"?"在正则表达式里是一个通配符:匹配前面的子表达式零次或一次,或指明一个非贪婪限定符。
在这里,通过测试,我们得出这样的结论:在不加"?"的情况下,在匹配下面一段内容的时候:
这是第一个table
This is the first table
I am not the content in the table
This is the second table
This is the third table
$str=preg_replace("/s+/", " ", $str); //Filter redundant carriage returns
$str=preg_replace("/<[ ]+/si","<",$str); //Filter <__("<" with a space after it)
/si","",$str); //Comments
$str=preg_replace("/<(!.*?)>/si","",$str); //Filter doctype
$str=preg_replace("/<(/?head.*?)>/si","",$str); //Filter head tag
$str=preg_replace("/<(/?meta.*?)>/si","",$str); //Filter meta tags
$str=preg_replace("/<(/?link.*?)>/si","",$str); //Filter link tag
$str=preg_replace("/<(/?form.*?)>/si","",$str); //Filter form tag
$str=preg_replace("/<(applet.*?)>(.*?)<(/applet.*?)>/si","",$str); //Filter applet tag
$str=preg_replace("/<(style.*?)>(.*?)<(/style.*?)>/si","",$str); //Filter style tag
$str=preg_replace("/<(title.*?)>(.*?)<(/title.*?)>/si","",$str); //Filter title tag
$str=preg_replace("/<(object.*?)>(.*?)<(/object.*?)>/si","",$str); //Filter object tag
$str=preg_replace("/<(/?objec.*?)>/si","",$str); //Filter object tag
$str=preg_replace("/<(noframes.*?)>(.*?)<(/noframes.*?)>/si","",$str); //Filter noframes tag
$str=preg_replace("/<(/?noframes.*?)>/si","",$str); //Filter noframes tag
$str=preg_replace("/<(i?frame.*?)>(.*?)<(/i?frame.*?)>/si","",$str) ; //Filter frame tag
$str=preg_replace("/<(/?i?frame.*?)>/si","",$str); //Filter frame tag$str=preg_replace("/<(/?script.*?)>/si","",$str); //Filter script tags
www.bkjia.comtruehttp: //www.bkjia.com/PHPjc/445418.htmlTechArticleRegular expression matching html filters illegal characters to match an html tag. The matching table is as follows: [ss]* or [ss ]*? The above two expressions, one with ? and one without ?, so what does this mean...