擅长:python、mysql、java
<p>有点冗长的方法。bs4.7.1标准</p>
<pre><code>import requests
import pandas as pd
url = 'https://data.mongabay.com/igapo/australia/postcodes/sydney-numeric.html'
r = requests.get(url, headers = {'User-Agent' : 'Mozilla/5.0'})
soup = bs(r.content, 'lxml')
codes = []
names = []
for line in soup.select_one('h1:contains("Postal codes") ~ font [size="2"]').text.split('\n'):
if line[:4]:
codes.append(line[:4]), names.append(line[5:])
df = pd.DataFrame(list(zip(codes, names)), columns = ['code', 'name'])
</code></pre>