更新时间:2022-03-30 21:31:45
只需对 titleSoup
>>> titleSoup=firstBlockSoup.find('a',attrs={'class':'fk-srch-title-text fksd-bodytext'})
>>> titleSoup.text
u'Wilco Classic Library: Autobiography Of a Yogi (Hardcover)'
这也将起作用:
invalid_tags = ['b']
titleSoup=firstBlockSoup.find('a',attrs={'class':'fk-srch-title-text fksd-bodytext'})
for tag in invalid_tags:
for match in titleSoup.findAll(tag):
match.replaceWithChildren()
print "".join(titleSoup.contents)