如何在Python中的两个xml标记之间获取全部内容?

更新时间：2023-11-24 12:52:22

from lxml import etree
t = etree.XML(
"""<?xml version="1.0" encoding="UTF-8"?>
<review>
  <title>Some testing stuff</title>
  <text>Some text with <extradata>data</extradata> in it.</text>
</review>"""
)
(t.text + ''.join(map(etree.tostring, t))).strip()

这里的诀窍是t是可迭代的，并且在迭代时会产生所有子节点.由于etree避免了文本节点，因此还需要使用t.text恢复第一个子标记之前的文本.

The trick here is that t is iterable, and when iterated, yields all child nodes. Because etree avoids text nodes, you also need to recover the text before the first child tag, with t.text.

In [50]: (t.text + ''.join(map(etree.tostring, t))).strip()
Out[50]: '<title>Some testing stuff</title>\n  <text>Some text with <extradata>data</extradata> in it.</text>'

或者:

In [6]: e = t.xpath('//text')[0]

In [7]: (e.text + ''.join(map(etree.tostring, e))).strip()
Out[7]: 'Some text with <extradata>data</extradata> in it.'

上一篇 : ：安装SQL Server 2014 express local-db时出错下一篇 : MySQL表结构，我需要主键吗?

如何在Python中的两个xml标记之间获取全部内容?

相关阅读

推荐文章