在XML中查找URL的所有页面id

1条回答

网友

1楼 · 发布于 2024-04-20 11:39:30

我通过复制您在问题中提供的内容并输入一些id，创建了一个xml文件。你知道吗

<pages>
    <page>
     <id>1</id>
     <name></name>
     <description>&lt;a href=&quot;http://google.com&quot; target=&quot;_self&quot;&gt;LINK&lt;/a&gt;</description>
     <boxes>
      <box>
      </box>
     </boxes>
    </page>
    <page>
     <id>2</id>
     <name></name>
     <description>&lt;a href=&quot;http://google.com&quot; target=&quot;_self&quot;&gt;LINK&lt;/a&gt;</description>
     <boxes>
      <box>
      </box>
     </boxes>
    </page><page>
     <id>3</id>
     <name></name>
     <description>&lt;a href=&quot;http://google.com&quot; target=&quot;_self&quot;&gt;LINK&lt;/a&gt;</description>
     <boxes>
      <box>
      </box>
     </boxes>
    </page>
</pages>

这段代码显示了ID和描述。你知道吗

>>> from lxml import etree
>>> tree = etree.parse('temp.xml')

>>> for page in tree.xpath('.//page'):
...     page.xpath('id')[0].text, page.xpath('description')[0].text
... 
('1', '<a href="http://google.com" target="_self">LINK</a>')
('2', '<a href="http://google.com" target="_self">LINK</a>')
('3', '<a href="http://google.com" target="_self">LINK</a>')

相关问题更多 >

编程相关推荐

热门问题

热门文章

在XML中查找URL的所有页面id

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >