继续在错误"无法解码来自马里奥内特的响应"后运行Selenium脚本

from pyvirtualdisplay import Display from time import sleep import sys reload(sys) sys.setdefaultencoding('utf-8') from selenium import webdriver from selenium.common.exceptions import TimeoutException from selenium.webdriver.firefox.options import Options display = Display(visible=0, size(800,600)) display.start() urlsFile = open ("urls.txt", "r") urls = urlsFile.readLines() driver = webdriver.Firefox(executable_path='/usr/local/lib/geckodriver/geckodriver') driver.set_page_load_timeout(60) for url in urls: try: driver.get(url) sleep(0.8) print(driver.title) except TimeoutException as e: print("Timeout")

1条回答

网友

1楼 · 发布于 2024-04-20 03:52:38

注意：这是我第一次尝试编写Python

您只需要构建一种方法，以便在GET操作失败时重试它。您仍然希望放弃一定次数的重试，但至少这应该捕捉到每个URL的一次性失败。在

def retryable_get(self, url, max_tries = 5)
  attempts = 0
  while attempts < max_tries
    try:
      self.get(url)
    except Exception:
      puts 'An error occured performing a GET to ' + url
    finally:
      attempts += 1
  raise TimeoutException(f'Failed to GET {url} after {max_tries} attempts')

您可以使用以下方法调用它：

^{pr2}$

或者，如果您想要更面向对象的方法，请键入Firefox类：

webdriver.Firefox.retryable_get = retryable_get

for url in urls:
  try:
    driver.retryable_get(url)
    sleep(0.8)
    print(driver.title)
  except TimeoutException as e:
    print("Timeout")

相关问题更多 >

编程相关推荐

热门问题

热门文章