从一个同时提到卧室数量的字符串中提取平方米的最佳方法是什么？

sqm = [] for item in soup.findAll('div', attrs={'class': 'xl-surface-ch'}): item = item.contents[0].strip()[0:4] item_clean = re.findall("[0-9]{2,4}", item) sqm.append(item_clean) print(sqm)

[['84'], ['70'], ['80'], ['32'], ['149'], ['22'], ['75'], ['30'], ['23'], ['104'], [], ['95'], ['129'], ['26'], ['55'], ['26'], ['25'], ['28'], ['33'], ['210'], ['37'], ['69'], ['36'], ['19'], ['119'], ['20'], ['20'], ['129'], ['154'], ['25']]

1条回答

网友

1楼 · 发布于 2024-04-25 13:39:48

import requests
from bs4 import BeautifulSoup

r = requests.get(
    'https://www.immoweb.be/en/search/apartment/for-sale/leuven/3000')
soup = BeautifulSoup(r.text, 'html.parser')

for item in soup.findAll('div', attrs={'class': 'xl-surface-ch'}):
    item = item.text.strip()
    if 'm²' in item:
        print(item[0:item.find('m')])
    else:
        item = 0
        print(item)

输出：

相关问题更多 >

编程相关推荐

热门问题

热门文章

从一个同时提到卧室数量的字符串中提取平方米的最佳方法是什么？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >