带有BeautifulSoup的Python XML数据

<ticket> <id>123456789</id> <create_date>2017-12-09</create_date> - <correspondence> - <diary_entry> - <user>username</user> - <timestamp>2017-12-09</timestamp> - <body>A bunch of text in here a lot of text more text - </body> - </diary_entry> - </correspondence> - <work_log> - <diary_entry> - <user>someotheruser</user> - <timestamp>2017-12-09</timestamp> - <body>Some more text in here and other text - </body> - </diary_entry> - </work_log> </ticket

ticket_url = "https://somelink/tickets/123456789" r = requests.get(ticket_url, auth=HTTPBasicAuth('username', 'password')) soup = BeautifulSoup(r.content,'xml') updates = soup.findAll('body') for update in updates: if "next steps:" not in update.text.lower(): print "no" print update.text.lower() else: print "yes"

1条回答

网友

1楼 · 发布于 2024-05-28 22:41:30

如果票证中只有一组“通信”和“正文”，则可以链接“查找”命令：

update = soup.find('correspondence').find('body')

否则，您必须使用find_all（Beauty Soup 4）或findAll（Beauty Soup 3）进行迭代：

correspondences = soup.find_all("correspondence")
for correspondence in correspondences:
    updates = correspondence.find_all("body")
    for update in updates:
    .....

相关问题更多 >

编程相关推荐

热门问题

热门文章