抓取不断更新的网页

driver = webdriver.Chrome() driver.get('http://oasis.caiso.com/mrioasis/logon.do') PublicBids = driver.find_element(By.XPATH, '//*[@id="IMG_111854124"]') PublicBids.click() dates = ['04/18/2019'] def BidsScraper(d): time.sleep(2) dateField = driver.find_element(By.XPATH,'//*[@id="TB_101685670"]') dateField.send_keys(d) DownloadCSV = driver.find_element(By.XPATH, '//*[@id="BTN_101685706"]') DownloadCSV.click()

2条回答

网友

1楼 · 编辑于 2024-06-06 12:08:00

要尝试的两件事是强制刷新停止，并且仅当元素是用Selenium找到的时候才单击，或者如果这仍然对您不起作用，我通常会尝试一些方法，比如用AppRobotic Personal之类的宏程序将鼠标移到X/Y坐标，然后模拟鼠标单击按钮的X/Y坐标。在Try/Except中与此类似的内容：

import win32com.client
x = win32com.client.Dispatch("AppRobotic.API")
from selenium import webdriver

driver = webdriver.Chrome()
driver.get('http://oasis.caiso.com/mrioasis/logon.do')
PublicBids = driver.find_element(By.XPATH, '//*[@id="IMG_111854124"]')
PublicBids.click()
dates = ['04/18/2019']

def BidsScraper(d):
    # wait for loading
    x.Wait(2000)
    # forcefully stop page reload at this point
    driver.execute_script("window.stop();")
    try:
        dateField = driver.find_element(By.XPATH,'//*[@id="TB_101685670"]')
        dateField.send_keys(d)
        DownloadCSV = driver.find_element(By.XPATH, '//*[@id="BTN_101685706"]')
    #Confirm that button was found
        if len(DownloadCSV) > 0
            DownloadCSV.click()
    except:
        dateField = driver.find_element(By.XPATH,'//*[@id="TB_101685670"]')
        x.Type(d)
        # use UI Item Explorer to find the X,Y coordinates of button
        x.MoveCursor(438, 435)
        # click on button
        x.MouseLeftClick
    x.Wait(2000)

网友

2楼 · 编辑于 2024-06-06 12:08:00

解决这个问题的一种方法是相对于一些静态Id查找所需的元素/按钮，而不是直接转到元素的动态Id

我不知道确切的XPath，但是例如，包装日期输入的div的Id为PFC_Public_Bids_date_from，因此您可以尝试以下操作

dateField = driver.find_element(By.XPATH,'//*[@id="PFC_Public_Bids_date_from"]//input')。你知道吗

类似地，按钮可能类似于：

DownloadCSV = driver.find_element(By.XPATH, '//*[@id="CsvExportButton"]//button')

相关问题更多 >

编程相关推荐

热门问题

热门文章