使用python进行Web数据抓取

from urllib.request import urlopen from bs4 import BeautifulSoup url = 'http://money.rediff.com/companies/Bajaj-Auto-Ltd/10540026' data = urlopen(url) soup = BeautifulSoup(data) te=soup.find('a',attrs={'target':'_jbpinter'}) lis=te.find_all_next('a',attrs={'target':'_jbpinter'}) #print(lis) for li in lis: print(li.find('a').contents[0])

1条回答

网友

1楼 · 发布于 2024-05-16 14:02:23

您试图获取a标记两次。在

更换

for li in lis:
    print(li.find('a').contents[0])

与

^{pr2}$

你可以得到这样的输出：

Need Different Rates For Different Products: Rahul Bajaj on GST
Reforms irrespective of Bihar results: Bajaj
Auto shares in focus; Tata Motors up over 5%
We believe new Avenger will stimulate the market: Bajaj Auto's Eric Vas
BHP Billiton pins future of Indonesian coal mine on new...

编程相关推荐

java Android HttpClient cookies
如何使用Java在远程系统上运行SSH命令？
java从字符串数组中的字符串末尾删除“，”
在One plus 3t手机上，当应用程序被终止或从最近的应用程序中刷出时，java Android FCM推送通知不起作用
java如何使垂直滚动条始终位于jtable的末尾
在java中解析迄今为止“未知”的字符串
javascript在Java中获取Nashorn JsonObject
java windows 10和ubuntu可以使用相同的JDK吗？
java在不同的文件中记录不同的日志。但所有日志都放在同一个文件中
具有特定jdk的java Gradle构建项目

相关问题更多 >

编程相关推荐

热门问题

热门文章

使用python进行Web数据抓取

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >