lxml：处理tai时元素addnext（）和insert（）之间的区别

>>> import lxml.etree >>> s = "This is bold and this is italic text." # Create a new lxml element. >>> xml = lxml.etree.fromstring(s) # Let's look at the element, its child, and all the texts and tails. >>> lxml.etree.tostring(xml) b'This is bold and this is italic text.' >>> xml.text 'This is ' >>> xml.tail >>> xml[0].text 'bold' >>> xml[0].tail ' and this is italic text.'

# Adds the element as a following sibling directly after this element. # Note that tail text is automatically discarded when adding at the root level. >>> xml[0].addnext(new_c) >>> lxml.etree.tostring(xml) b'This is bolditalic text. and this is '

2条回答

网友
1楼 · 编辑于 2024-05-16 21:50:24

tail只存在于lxml的级别上；在libxml2中，它是一个文本节点，就像它在DOM中一样。主要原因是解析格式良好的XML（http://lxml.de/tutorial.html#elements-contain-text）时的便利性：
The two properties .text and .tail are enough to represent any text content in an XML document. This way, the ElementTree API does not require any special text nodes in addition to the Element class, that tend to get in the way fairly often (as you might know from classic DOM APIs).
{afs>努力从源代码维护所有抽象的函数。E、 g.index()只统计元素/注释/entityrefs/PI节点，而树操作例程似乎总是会移动节点的尾部。但是，由于这个概念
是如此的缺乏记录
是为用户不关心尾随文本的XML而定制的
与常规代表权冲突
它的应用似乎有不一致之处。这看起来像是一个错误（如果一致性是一个目标的话也是一个bug）。我将与维护人员讨论最后一条语句，以澄清库关于tails的预期行为。在

网友
2楼 · 编辑于 2024-05-16 21:50:24

elem.addnext(nextelem)在XML级别进行操作，即直接在元素之后添加内容，将任何尾部文本移到新插入的元素后面。这样做是为了使新元素成为一个直接跟在后面的同级元素。在
parent.insert(where,elem)的工作方式与父元素只是etree.Element的列表一样。它将新元素放入列表中，而不会对etree.元素实例。parent.append(elem)也可以这样工作，或者任何其他的列表操作。在
因此，这些函数在元素树上有两个不同的视图。在
>>> from lxml import etree as et >>> >>> x = et.XML('<a>foobar</a>') >>> y = et.XML('<c>C!</c>') >>> >>> et.dump(x) <a>foobar</a> >>> x.find('b').addnext(y) >>> et.dump(x) <a>foo<c>C!</c>bar</a>
尾部从b元素移动到c元素，以保持除了插入元素之外的XML文档不变。在
现在，如果插入的元素已经有尾部，addnext用于插入元素及其后面的文本。直接在XML元素之后，而不是在带有tail的etree元素之后。在
^{pr2}$

相关问题更多 >

编程相关推荐

热门问题

热门文章