从需要时间加载的网页中抓取数据时出现问题

from bs4 import BeautifulSoup as bs from requests import get html = get("https://www.cbn.gov.ng/rates/ExchRateByCurrency.asp").text html = bs(html,"lxml") html = html.find("div",id="ContentTextinner")

2条回答

网友

1楼 · 编辑于 2024-06-16 12:15:46

您可以使用像selenium这样的库来实现这一点

例如：

from selenium import webdriver
from bs4 import BeautifulSoup as bs

driver = webdriver.Firefox()
driver.get("https://www.cbn.gov.ng/rates/ExchRateByCurrency.asp")

html = driver.page_source
print(html.find("div",id="ContentTextinner"))

driver.quit()

网友

2楼 · 编辑于 2024-06-16 12:15:46

我想这就是你想要的：

import requests as r

res = r.get('https://www.cbn.gov.ng/rates/outputExchangeRateJSN.asp')
if res.status_code == 200:
    data = res.json()
    # Do something with the data
else:
    print(f"Error: {res.status_code}")

您将以JSON的形式获取数据，并从中提取您需要的内容

这是因为请求是动态发出的，以填充页面的主体，这就是为什么您无法从第一个页面中找到内容

您也可以使用此链接将其作为CSV文件下载，所有内容：CSV_File

相关问题更多 >

编程相关推荐

热门问题

热门文章

从需要时间加载的网页中抓取数据时出现问题

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >