如何使用python将base64编码的图像废弃并存储在数据库中

import urllib import urllib.request from bs4 import BeautifulSoup import base64 import os from string import ascii_lowercase def make_soup(url): thepage = urllib.request.urlopen(url) soupdata = BeautifulSoup(thepage, "html.parser") return soupdata soup = make_soup("https://mahabhunakasha.mahabhumi.gov.in/bhunaksha/27/index.jsp") soups = soup.decode('utf-8') print(soups) # for img in soup.findAll("src"): # print(img)

1条回答

网友

1楼 · 发布于 2024-04-27 03:50:23

为了避免我使用selenium时出现的SSL错误，您可以下载一个firefox exe并传递可执行路径，然后在安装selenium模块后，您就可以使用以下命令刮取页面：

#!pip install selenium

from bs4 import BeautifulSoup
from selenium import webdriver

driver = webdriver.Firefox(executable_path = 'C:\Gecko\geckodriver.exe')
thepage = driver.get("https://mahabhunakasha.mahabhumi.gov.in/bhunaksha/27/index.jsp")
html = driver.page_source
soup = BeautifulSoup(html,'lxml')

我没有看到这个页面上的任何图像，它似乎是一个画布类。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章