使用beautifulsoup进行抓取，并在多个页面上迭代，其中两个参数在u中更改

2024-04-16 23:09:34 发布

您现在位置：Python中文网/ 问答频道 /正文

2929

网友

男 | 程序猿一只，喜欢编程写python代码。

我想用beautifulsoup刮几页。但是，url中有两个参数正在更改

到目前为止，我一直在尝试这个代码，但运气不佳

from urllib.request import urlopen

base_url= "https://superstats.dk/"
     n = 8
      for i in range(1, n+1):
       if (i == 1):
       # handle first page
    response = urlopen(base_url)
    response = urlopen(base_url + "program?aar=201" % i)
    response_plus =urlopen(response + "%2F201" % i+1)
    data = response_plus.read()

这是我想要在几个页面上迭代的输出

 import requests 
 from bs4 import BeautifulSoup

  r = requests.get('https://superstats.dk/program?aar=2018%2F2019')
  bs=BeautifulSoup(r.content, "lxml")

  table_div=bs.find(id="content")
  rows = table_div.find_all('tr')
  for row in rows:
     cols=row.find_all('td')
     cols=[x.text.strip() for x in cols]
     print (cols)

Tags： in from https import url for base response

1条回答

网友

1楼 · 发布于 2024-04-16 23:09:34

使用format()函数更改两个参数的值

for i in range(1,9):
  url='https://superstats.dk/program?aar=201{}%2F201{}'.format(i,i+1)
  print(url)

希望这会有所帮助

使用beautifulsoup进行抓取，并在多个页面上迭代，其中两个参数在u中更改

相关问题更多 >

编程相关推荐

热门问题

热门文章

使用beautifulsoup进行抓取，并在多个页面上迭代，其中两个参数在u中更改

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >