擅长:python、mysql、java
<p>通常我会得到最后一页,并得到所有的页面,但这个网站不让我得到最后一页。
此过程将在检查最后一页后完成。
pandas.read_html非常简单</p>
<pre><code> import pandas as pd
import requests
from bs4 import BeautifulSoup
url = "https://www.boursorama.com/bourse/produits-de-bourse/levier/warrants/resultats?\
warrant_filter%5Bnature%5D=1&\
warrant_filter%5BunderlyingType%5D=&\
warrant_filter%5BunderlyingName%5D=TESLA&\
warrant_filter%5Bmaturity%5D=0&\
warrant_filter%5BdeltaMin%5D=&\
warrant_filter%5BdeltaMax%5D=&\
warrant_filter%5Bissuer%5D=&\
warrant_filter%5Bsearch%5D="
frames = []
i = 0
for i in range(19):
r = requests.get(url+'page={}'.format(i))
df_list = pd.read_html(r.text)
df = df_list[0]
frames.append(df)
i += 1
res = pd.concat(frames, ignore_index=True)
</code></pre>