创建一个css选择器以在一个单独的窗口中定位多个id

from lxml.html import fromstring html = """ <div class="rest-list-information"> <a class="restaurant-header" href="/madison-wi/restaurants/pizza-hut"> Pizza Hut </a> <div id="featured other-dynamic-ids"> <span>Sponsored</span> </div> </div> <div class="rest-list-information"> <a class="restaurant-header" href="/madison-wi/restaurants/salads-up"> Salads UP </a> <div id="other-dynamic-ids border"> <span>Featured</span> </div> </div> """ root = fromstring(html) for item in root.cssselect("[id~='featured'] span,[id~='border'] span"): print(item.text)

2条回答

网友

1楼 · 编辑于 2024-05-01 22:07:11

如果您只是想从HTML中获取所有“span”文本，那么以下内容就足够了：

root_spans = root.xpath('//span')

for i, root_spans in enumerate(root_spans):
    span_text = root_spans.xpath('.//text()')[0]
    print(span_text)

网友

2楼 · 编辑于 2024-05-01 22:07:11

你可以做：

.rest-list-information div span

但我认为把逗号弄乱是个坏主意。你不会找到很多没有逗号的样式表。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章