我试图从下面的URL中提取化学名称(全部大写)
https://www.legislation.gov.au/Details/F2020L01255
我对附表4所示的化学品感兴趣
import requests
import re
from requests import get
from bs4 import BeautifulSoup
import pandas as pd
import numpy as np
url = 'https://www.legislation.gov.au/Details/F2020L01255'
headers = {"Accept-Language": "EN-AU, en;q=0.5"}
results = requests.get(url, headers=headers)
soup = BeautifulSoup(results.text, "html.parser")
chemicals = []
chems_div = soup.find_all('div', class_='WordSection7')
我被困在这里了。化学名称用class='MsoNormal'和lang='EN-AU'包裹在P标签和Span标签周围
试试这个:
输出:
相关问题 更多 >
编程相关推荐