Linux Bash：轻松删除 HTML 表数据块

Question

我有一个使用bash脚本处理的html文件，并且想要删除空表。该文件是从sql语句生成的，但在未找到记录时包含表头。我想删除没有找到记录的标题。Tablewithdatatype<

P粉242741921 · Answer

像这样，使用 xmlstarlet 和 xpath：

$ xmlstarlet format -H file.html | sponge file.html
$ xmlstarlet ed -d '//table[./caption/text()="Empty Table To Remove"]' file.html

Data rows exists here

Table with data
type	column1	column2	column3	column4

Data rows exists here

Table with data
type	column1	column2	column3	column4

要在 sed -i 等位置进行编辑，请使用

xmlstarlet edit -L ...