用BeautifulSoup刮网站

{'class': ['W(100%)', 'Va(t)', 'Px(0)'], 'data-reactid': '.odbtogw33w.0.0.$uh.2.0.1.0.1.0.0.0'} {'class': ['Va(t)', 'Tren(os)', 'W(10%)', 'Whs(nw)', 'Px(0)', 'Bdcl(s)'], 'data-reactid': '.odbtogw33w.0.0.$uh.2.0.1.0.1.0.0.1'}

2条回答

网友

1楼 · 编辑于 2024-05-16 06:46:20

所以这是我写的附加代码，现在我可以很好地保存动态生成的内容，并使用BeautifulGroup获得我想要的标记：

from contextlib import closing
from selenium.webdriver import Firefox
from selenium.webdriver.support.ui import WebDriverWait

with closing(Firefox()) as browser:
    browser.get('https://finance.yahoo.com/quote/IONS?p=IONS')
    button = browser.find_element_by_link_text('Statistics')
    button.click()
    #WebDriverWait(browser, timeout=10).until(
        #lambda x: x.find_element_by_class_name('Fz(s) Fw(500) Ta(end)'))
    page_source = browser.page_source
print(page_source)

yahoo_finance = BeautifulSoup(page_source, 'html.parser')

@neftes@Padraic Cunningham谢谢你的提示。在

网友

2楼 · 编辑于 2024-05-16 06:46:20

只需使用请求就可以获得数据，内容是从ajax get tohttps://query1.finance.yahoo.com/v10/finance/quoteSummary/IONS生成的：

from pprint import pprint as pp
import requests

params = {"formatted": "true", "lang": "en-US", "region": "US",
          "modules": "defaultKeyStatistics,financialData,calendarEvents", "corsDomain": "finance.yahoo.com"}

url = "http://finance.yahoo.com/quote/IONS/key-statistics?p=IONS"
ajax = "https://query1.finance.yahoo.com/v10/finance/quoteSummary/IONS"

with requests.Session() as s:
    cont = requests.get(url).content
    data = s.get(ajax, params=params).json()

    pp(data[u'quoteSummary']["result"])

这给了你：

^{pr2}$

相关问题更多 >

编程相关推荐

热门问题

热门文章

用BeautifulSoup刮网站

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >