无法在python中打印从regex（严格来说仅限于此）模块检索到的数据？

import urllib import re symbolslist = ["appl","spy","goog","nflx"] i=0 while i<len(symbolslist): url ="http://in.finance.yahoo.com/q?s=" +symbolslist[i] +"&ql=1" htmlfile = urllib.urlopen(url) htmltext = htmlfile.read() regex ='(.+?)' pattern = re.compile(regex) print regex price = re.findall(pattern,htmltext) print "price of ",symbolslist[i],"is",price i+=1

(.+?) price of appl is [] (.+?) price of spy is [] (.+?) price of goog is [] (.+?) price of nflx is []

1条回答

网友

1楼 · 发布于 2024-04-26 03:03:59

作为另一种方法，您可能会发现使用^{}库更容易，如下所示：

from yahoo_finance import Share

for symbol in ["appl", "spy", "goog", "nflx"]:
    yahoo = Share(symbol)
    print 'Price of {} is {}'.format(symbol, yahoo.get_price())

提供以下输出：

Price of appl is 96.11
Price of spy is 186.63
Price of goog is 682.40
Price of nflx is 87.40

尝试使用正则表达式解析HTML数据从来都不是明智之举。你知道吗

另一种方法是首先使用BeautifulSoup提取信息：

from bs4 import BeautifulSoup
import requests
import re

for symbol in ["appl", "spy", "goog", "nflx"]:
    url = 'http://finance.yahoo.com/q?s={}'.format(symbol)
    r = requests.get(url)
    soup = BeautifulSoup(r.text, "html.parser")

    data = soup.find('span', attrs= {'id' : re.compile(r'yfs_.*?_{}'.format(symbol.lower()))})
    print 'Price of {} is {}'.format(symbol, data.text)

相关问题更多 >

编程相关推荐

热门问题

热门文章