因此,我试图从media.com获取我的统计数据,并且已经构建了一个机器人来登录,当我进入统计页面并试图打印标题时,它会不断向我抛出所有的html。print
函数用于确保在我继续之前打印正确的内容:
url = driver.page_source
headers = {"Accept-Language": "en-US, en;q=0.5"}
results = requests.get(url, headers=headers)
soup = BeautifulSoup(url, "lxml")
story_title = []
publication = []
views = []
reads = []
read_ratio = []
fans = []
stats_div = soup.find_all('tr', class_='sortableTable-row js-statsTableRow')
for container in stats_div:
name = container.td.a.text.find('span', class_='sortableTable-title u-maxWidth450')
story_title.append(name)
print(story_title)
没关系,明白了!Selenium不喜欢
url = driver.page_source
,所以我只使用了链接相关问题 更多 >
编程相关推荐