我想要以下功能.
input : this is test <b> bold text </b> normal text expected output: this is test normal text
即删除指定标签的内容
解决方法
使用BeautifulSoup的解决方案:
from BeautifulSoup import BeautifulSoup
def removeTag(soup,tagname):
for tag in soup.findAll(tagname):
contents = tag.contents
parent = tag.parent
tag.extract()
s = BeautifulSoup("abcd <b> btag </b> hello <d>dtag</d>")
removeTag(s,"b")
print s
removeTag(s,"d")
print s
收益:
>>> abcd hello <d>dtag</d> abcd hello