试图解析网页时发生Python错误重定向

from urllib.request import urlopen from bs4 import BeautifulSoup html = urlopen("http://www.animeplus.tv/anime-show-list/") content =(html.read()) soup = BeautifulSoup(content) print(soup.prettify())

1条回答

网友

1楼 · 发布于 2024-04-25 14:25:53

这里的技巧是页面重定向到自身并设置Cookie头，这很重要，没有它，您将无法获得在浏览器中看到的HTML

下面是使用^{}的解决方案-在同一个session中打开同一页：

import requests
from bs4 import BeautifulSoup

url = "http://www.animeplus.tv/anime-show-list/"
session = requests.session()
session.get(url)
response = session.get(url)  # open up the page second time
soup = BeautifulSoup(response.content)
print(soup.title.text)  # prints: "Watch Anime | Anime Online | Free Anime | English Anime | Watch Anime Online - AnimePlus.tv"

或者，您可以使用^{}，但目前它不支持python 3。下面是它的工作原理：

>>> import mechanize
>>> browser = mechanize.Browser()
>>> browser.open('http://www.animeplus.tv/anime-show-list/')
>>> print browser.response().read()
<!DOCTYPE html>
<html>
<head>
  <title>Watch Anime | Anime Online | Free Anime | English Anime | Watch Anime Online - AnimePlus.tv</title> 
...

相关问题更多 >

编程相关推荐

热门问题

热门文章

试图解析网页时发生Python错误重定向

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >