从lxml.objectify.ObjectifiedElement中剥离Python命名空间属性

7 投票
2 回答
7642 浏览
提问于 2025-04-16 18:24

可能重复的问题:
使用lxml时,XML能否在没有命名空间属性的情况下渲染?

我该如何去掉一个 lxml.objectify.ObjectifiedElement 中的Python属性呢?

示例:

In [1]: from lxml import etree, objectify
In [2]: foo = objectify.Element("foo")
In [3]: foo.bar = "hi"
In [4]: foo.baz = 1
In [5]: foo.fritz = None
In [6]: print etree.tostring(foo, pretty_print=True)
<foo xmlns:py="http://codespeak.net/lxml/objectify/pytype" xmlns:xsd="http://www.w3.org/2001/XMLSchema" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" py:pytype="TREE">
  <bar py:pytype="str">hi</bar>
  <baz py:pytype="int">1</baz>
  <fritz xsi:nil="true"/>
</foo>

我希望输出的结果看起来像这样:

<foo>
  <bar>hi</bar>
  <baz>1</baz>
  <fritz/>
</foo>

2 个回答

13

你可以通过使用 etree.strip_attributesetree.cleanup_namespaces 来实现这个功能。

In [8]: etree.strip_attributes(foo, '{http://codespeak.net/lxml/objectify/pytype}pytype')
In [9]: print etree.tostring(foo, pretty_print=True)
<foo xmlns:py="http://codespeak.net/lxml/objectify/pytype" xmlns:xsd="http://www.w3.org/2001/XMLSchema" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
  <bar>hi</bar>
  <baz>1</baz>
  <fritz xsi:nil="true"/>
</foo>

In [10]: etree.cleanup_namespaces(foo)
In [11]: print etree.tostring(foo, pretty_print=True)
<foo xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
  <bar>hi</bar>
  <baz>1</baz>
  <fritz xsi:nil="true"/>
</foo>

不过,这样做还会留下 xsi:nil 的引用,你可以用类似的方法把它去掉。

In [12]: etree.strip_attributes(foo, '{http://www.w3.org/2001/XMLSchema-instance}nil')
In [13]: etree.cleanup_namespaces(foo)
In [14]: print etree.tostring(foo, pretty_print=True)
<foo>
  <bar>hi</bar>
  <baz>1</baz>
  <fritz/>
</foo>
5

还有一个专门的函数叫做 objectify.deannotate(...):

Help on built-in function deannotate in module lxml.objectify:

      deannotate(...)
        deannotate(element_or_tree, pytype=True, xsi=True, xsi_nil=False, cleanup_namespaces=False)

        Recursively de-annotate the elements of an XML tree by removing 'py:pytype'
        and/or 'xsi:type' attributes and/or 'xsi:nil' attributes.

        If the 'pytype' keyword argument is True (the default), 'py:pytype'
        attributes will be removed. If the 'xsi' keyword argument is True (the
        default), 'xsi:type' attributes will be removed.
        If the 'xsi_nil' keyword argument is True (default: False), 'xsi:nil'
        attributes will be removed.

        Note that this does not touch the namespace declarations by
        default.  If you want to remove unused namespace declarations from
        the tree, pass the option ``cleanup_namespaces=True``.

撰写回答