使用python请求的etrade抓取不希望使用跨域url

import requests from bs4 import BeautifulSoup, Comment symbol = 'A' payload = {'USER':etradeUsername, 'PASSWORD':etradePassword, 'countrylangselect':'us_english', 'TARGET':'/e/t/pfm/portfolioview'} with requests.Session() as c: c.post('https://us.etrade.com/login.fcc', data=payload) r=c.get('https://us.etrade.com/e/t/pfm/portfolioview') #r=c.get('https://www.etrade.wallst.com/v1/stocks/snapshot/snapshot.asp?symbol=' + symbol + '&rsO=new') etradeMarkup = BeautifulSoup(r.text) #print r.headers file1 = open("etrade.html","w") file1.write("<html><body><head><meta charset='UTF-8'></head>" + str(etradeMarkup.prettify().encode("utf-8")) + "</body></html>") file1.flush() file1.close()

1条回答

网友

1楼 · 发布于 2024-05-15 00:24:55

我发现还有另一个页面需要设置cookies。我以为推送到etrade登录页面是因为需要etrade登录后部分的cookies，但我错了。我根本不需要这个页面的etrade登录，只需要另一个页面来获取cookies。通过添加视图https://us.etrade.com/e/t/invest/markets?ploc=c-MainNav的行，我能够获得查看目标页面所需的数据，而不会迫使我的程序返回登录页面。在

with requests.Session() as c:

    #  adding this line was the key
    c.get('https://us.etrade.com/e/t/invest/markets?ploc=c-MainNav') 

    r=c.get('https://www.etrade.wallst.com/v1/stocks/snapshot/snapshot.asp?symbol=' + symbol + '&rsO=new')

    etradeMarkup = BeautifulSoup(r.text)

相关问题更多 >

编程相关推荐

热门问题

热门文章