从雅虎财经刮取数据

import requests from lxml import html xp = "//span[text()='Sector']/following-sibling::span[1]" symbol = 'AAPL' url = 'https://finance.yahoo.com/quote/' + symbol + '/profile?p=' + symbol page = requests.get(url) tree = html.fromstring(page.content) d = {}

3条回答

网友

1楼 · 编辑于 2024-04-26 04:52:29

看看这是否适合您：

xpp = tree.xpath('//div[@data-reactid=7]/p/text()[3]')[0].strip()
xpp

输出：

'United States'

网友

2楼 · 编辑于 2024-04-26 04:52:29

也许您可以将BeautifulSoup与正则表达式搜索结合使用，以筛选出位置：

import requests
from lxml import html
from bs4 import BeautifulSoup
import re

xp = "//span[text()='Sector']/following-sibling::span[1]"
symbol = 'TEVA'
url = 'https://finance.yahoo.com/quote/' + symbol + '/profile?p=' + symbol

page = requests.get(url)
soup = BeautifulSoup(page.content, 'html.parser')
baseTag = soup.findAll('p', {'class':"D(ib) W(47.727%) Pend(40px)"})
matches = re.findall("\  >(.*?)\<! ", str(baseTag))
print(matches[-1])

我用谷歌（Google）、苹果（Apple）和特瓦制药工业有限公司（Teva Pharmaceutical Industries Limited）对其进行了测试，结果似乎有效

网友

3楼 · 编辑于 2024-04-26 04:52:29

不要刮，而是使用yfinance，它会定期更新并简化一切：

import yfinance as yf
df = yf.download('TWTR')

如果要绘制它，请执行以下操作：

import finplot as fplt
fplt.candlestick_ochl(df[['Open','Close','High','Low']])
fplt.show()

相关问题更多 >

编程相关推荐

热门问题

热门文章