在jibberish中以漂亮的结果进行报废

<html> <head> </head> <body> }zƲu}y┴(M։ʖO┬┌;R° ─H$D◆P⎼^▒&▒└⎻;\␍␍ (Q│P]]]U]]U£œ␉NG/?5˶ض&±├;ӗ/D&▒└⎻;·GW5Q߶/..(ڧ?ڗV*V┘┌[;≥⎻^N0T4ۓ┐'┴┘S7׏; њ#─K

2条回答

网友

1楼 · 编辑于 2024-05-16 04:08:31

这对我来说很好

from bs4 import BeautifulSoup as bs4
import requests
import html5lib


def get_data():

    url = 'http://www.fdm.dk/bildatabasen/mazda/mazda3/15-100-hk/6-man-core-2017'
    r = requests.get(url, headers={"User-Agent": "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_11_2) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/48.0.2564.103 Safari/537.36"})
    html_bytes = r.text
    soup = bs4(html_bytes, 'html5lib')

    res = soup.find("body")
    print(res.prettify())

    return res

test1 = get_data()

<body>
 <header id="header">
  <div id="logo-section-desktop">
   <div class="rowf">
    <div class="small-12 medium-3 large-3 columns">
     <a href="/" id="desktop-logo">
      FDM
     </a>
    </div>
    <div class="small-12 medium-9 large-9 columns">
     <ul class="top-navigation inline-list">
      <li>
       <a href="https://fdm.dk/alt-om-biler/vild-med-biler/motor">
        Motor
       </a>
      </li>
      ...

网友

2楼 · 编辑于 2024-05-16 04:08:31

你可能从网站上得到了压缩数据。像johnashu一样，使用requests library将自动为您解压。您可以手动执行此操作，但这是一个更难的问题

相关问题更多 >

编程相关推荐

热门问题

热门文章

在jibberish中以漂亮的结果进行报废

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >