如何提取文本在一个扩展更多按钮使用scrapy？

2条回答

网友

1楼 · 编辑于 2024-06-01 04:31:46

您可能会在结尾看到“单击以展开”文本，但仍然会得到整个引用。您需要的是避免提取“单击以展开”文本。在

例如：

>>> response.xpath('//li[contains(@class, "message")][.//a/text()[.="#52365"]]//*[re:test(@class, "\\bquote\\b")]//text()').getall()
['CCS for model 3 coming', '\nWhile article references Europe, the North American theater will be getting a CCS adapter soon.', '\nSee article for', '\n', '\n', 'Tesla launches $190 CCS adapter for new Model S and Model X, offers retrofits for older vehicles', '\n', '\nMartian High Command', '\n', '\nPS: Text from article.', '\n', '\nUpdate: A Tesla spokesperson told us that they will make sure owners in North America will have access to all “compelling networks”, but they have nothing to announce now.']

网友

2楼 · 编辑于 2024-06-01 04:31:46

正如有人在评论中指出的，你不需要点击任何东西。如果在浏览器中打开“文档检查器”，则可以看到所有文本都在其中。在

您可以使用简单的css选择器和for循环检索所有邮件：

for post in sel.css('.messageList>li'): 
    text = ''.join(post.css('blockquote.messageText ::text').extract()) 
    print(text) 
    print('   ')

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何提取文本在一个扩展更多按钮使用scrapy？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >