Python是否更改ulsou URL的显示内容？

from BeautifulSoup import BeautifulSoup import urllib2 link = "https://www.cagematch.net/?id=8&nr=12&page=4" print link url = urllib2.urlopen(link) #Cagematch URL for PWG Events content = url.read() soup = BeautifulSoup(content) events = soup.findAll("tr", { "class" : "TRow" }) #Captures all event classes into a list, each event on site is separated by '<tr class="TRow">' for i in events[1:12]: #For each event, only searches over a years scope data = i.findAll("td", { "class" : "TCol TColSeparator"}) #Captures each class on an event into a list item, separated by "<td class="TCol TColSeparator>" date = data[0].text #Grabs Date of show, date of show is always first value of "data" list show = data[1].text #Grabs name of show, name of show is always second value of "data" list status = data[2].text #Grabs event type, if "Event (Card)" show hasn't occurred, if "Event" show has occurred. print date, show, status if status == "Event": #If event has occurred, get card data print "Event already taken place" link = 'https://cagematch.net/' + data[4].find("a", href=True)['href'] print content

1条回答

网友

1楼 · 发布于 2024-04-20 00:06:53

仅通过重新定义link变量不会触发页面内容的更改-您必须从新链接请求并下载页面：

link = 'https://cagematch.net/' + data[4].find("a", href=True)['href']
url = urllib2.urlopen(link) 
content = url.read()

其他注意事项：

您使用的是非常过时的BeautifulSoup版本3。更新到^{} 4：
```
pip install beautifulsoup4  upgrade
```
并将导入更改为：
```
from bs4 import BeautifulSoup
```
您可以通过切换到requests并对同一域的多个请求重用同一会话来提高性能
建议使用^{}连接URL的各个部分

相关问题更多 >

编程相关推荐

热门问题

热门文章