爬行wordreference时出现问题

import lxml.html as lh import urllib2 url = 'http://www.wordreference.com/es/translation.asp?tranword=crane' doc = lh.parse((urllib2.urlopen(url))) trans = doc.xpath('//td[@class="ToWrd"]/text()') for i in trans: print i

1条回答

网友

1楼 · 发布于 2024-06-12 08:19:39

看起来您需要发送一个User-Agent头，请参见Changing user agent on urllib2.urlopen。在

另外，只需切换到^{}就可以了（默认情况下，它会自动发送python-requests/version用户代理）：

import lxml.html as lh
import requests

url = 'http://www.wordreference.com/es/translation.asp?tranword=crane'

response = requests.get("http://www.wordreference.com/es/translation.asp?tranword=crane")
doc = lh.fromstring(response.content)

trans = doc.xpath('//td[@class="ToWrd"]/text()')
for i in trans:
    print(i)

印刷品：

^{pr2}$

编程相关推荐

java如何创建多个前台通知？
java会话无法在GAE服务器上运行，只能在本地运行
java组织。阿帕奇。吊索脚本编写。jsp。贾斯珀。JasperException:无法加载标记处理程序类
带负结果的java BigDecimal减法
JavaSpringWS请求GZIP压缩
java通过公共方法访问私有成员变量
java如何在Android Studio中改变listView中的项目数量？
java如何在Blackberry Eclipse中创建jar
java为什么在使用有界类型参数时需要强制转换
将arraylist中的数据转换为文本文件时出现java ConcurrentModificationException

相关问题更多 >

编程相关推荐

热门问题

热门文章

爬行wordreference时出现问题

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >