如何从beautifulsoup结果中删除标记 (比如:地址=[a,b,c,d,r……])
from bs4 import BeautifulSoup as bs
import requests
#
url = 'https://www.planetware.com/tourist-attractions-/oslo-n-osl-oslo.htm'
url_get = requests.get(url)
soup = bs(url_get.content, 'html.parser')
#
address=soup.find_all('p', class_="nospc")
address
<p class="nospc">Address: Nobels gate 32, N-0268 Oslo</p>,
<p class="nospc">Address: Akershus Festning, 0015 Oslo</p>,
<p class="nospc">Address: Frederiks gate 2, 0164 Oslo</p>,
<p class="nospc">Address: Universitetsgata 13, Oslo</p>,
<p class="nospc">Address: Tøyengata 53, 0578 Oslo</p>,
<p class="nospc">Address: Bellevue, Oslo</p>,
<p class="nospc">Address: Frederiks gate 2, 0164 Oslo</p>,
<p class="nospc">Address: Bygdøynesveien 39, 0286 Oslo</p>,
<p class="nospc">Address: Kongeveien 5, 0787 Oslo</p>,
<p class="nospc">Address: Karl Johansgt. 11, 0154 Oslo</p>,
<p class="nospc">Address: Rådhuset, 0037 Oslo</p>,
<p class="nospc">Address: Bryggegata 9, 0120 Oslo</p>,
<p class="nospc">Address: Sars gate 1, 0562 Oslo</p>,
<p class="nospc">Address: Kirsten Flagstads Plass 1, 0150 Oslo</p>]
我想要像这样的东西
Address = ['Nobels gate 32, N-0268 Oslo', 'Akershus Festning, 0015 Oslo' ...]
可以使用
text
属性获取标记中的内容:请尝试以下操作代码。它将拆分地址部分。你知道吗
输出:
.text
方法可以做到这一点。但是你不能在find_all
的列表中调用它 你必须遍历这个列表相关问题 更多 >
编程相关推荐