靓汤：替换返回的图像源的一部分

import requests from bs4 import BeautifulSoup import os.path url = "https://example.net/g/1" i = 1 data = requests.get(url) soup = BeautifulSoup(data.text, 'html.parser') for sou in soup.findAll("div", {"class": "gallery"}): sou.decompose() containers = soup.find_all('img') title = soup.find('h1').text imgsrc = containers for imgs in imgsrc: if ".jpg" in imgs['src']: sauce = (imgs['src']) if sauce[:1] =="/": image = 'https:' + sauce else: image = sauce nametemp = imgs.get('alt') if nametemp is None: filename = str(i) i = i+1 print(image)

1条回答

网友

1楼 · 发布于 2024-06-16 13:11:55

你的代码很混乱，与你的问题无关。因此，假设您有一个名为thumbnails的URL列表：

thumbnails = [
    'https://t.example.net/galleries/9/1t.jpg',
    'https://t.example.net/galleries/9/2t.jpg',
    'https://t.example.net/galleries/9/3t.jpg',
]

然后，您可以在列表中使用regex replace来转换您想要的URL：

import re
images = [re.sub(r't(\.jpg)', r'\1', url) for url in thumbnails]

相关问题更多 >

编程相关推荐

热门问题

热门文章