从BSE网站提取数据

from bs4 import BeautifulSoup import requests URL = 'https://www.bseindia.com/stock-share-price/smartlink-network-systems-ltd/smartlink/532419/' r = requests.get(URL) soup = BeautifulSoup(r.content, 'html5lib') table = soup.find('div', attrs = {'id':'newheaddivgrey'}) print(table)

from selenium import webdriver driver = webdriver.Chrome(r"C:\Program Files\JetBrains\PyCharm Community Edition 2017.3.3\bin\chromedriver.exe") driver.get('https://www.bseindia.com/stock-share-price/smartlink-network-systems-ltd/smartlink/532419/') table=driver.find_elements_by_xpath('//*[@id="SecuritywiseDeliveryPosition"]/table/tbody/tr/td/table/tbody/tr[1]/td') print(table) driver.quit()

1条回答

网友

1楼 · 发布于 2024-05-14 12:59:42

在用Selenium加载页面后，可以使用driver.page_source获得Javascript修改的页面源代码。然后您可以在beauthoulGroup对象中传递这个页面源代码。在

driver = webdriver.Chrome()
driver.get('https://www.bseindia.com/stock-share-price/smartlink-network-systems-ltd/smartlink/532419/')
html = driver.page_source
driver.quit()

soup = BeautifulSoup(html, 'lxml')
table = soup.find('div', id='SecuritywiseDeliveryPosition')

这段代码将为您提供table变量中的Securitywise Delivery Position表。然后，您可以解析这个BeautifulSoup对象以获得所需的不同值。在

soup对象包含包含动态添加的元素的整页源。现在，您可以解析它来获得您提到的所有内容。在

相关问题更多 >

编程相关推荐

热门问题

热门文章