从动态HTML选项卡中提取所有数据

2条回答

网友

1楼 · 编辑于 2024-04-23 09:15:26

使用xpath而不是td/IDs提取行，因为它们不是常量。你知道吗

单击next page按钮，然后再次提取行，直到next page按钮Click给出NotFoundException（取决于按钮是否在最后一页上不可见）。如果你提供的HTML或网站链接，你会得到一个更好的答案。你知道吗

网友

2楼 · 编辑于 2024-04-23 09:15:26

经过大量测试，答案如下：

 try:
        last_row = driver.find_element_by_xpath(".//tr/*[contains(@id, ' TilesTable-rows-row19-col1')]")
        last_row_old = driver.find_element_by_xpath(".//tr/*[contains(@id, ' TilesTable-rows-row19-col1')]").text
        last_row.click()
        last_row.send_keys(Keys.PAGE_DOWN)
        time.sleep(2)
        last_row_new = driver.find_element_by_xpath(".//tr/*[contains(@id, ' TilesTable-rows-row19-col1')]").text

        while (last_row_new == last_row_old) is False:
            table = driver.find_element_by_xpath("//*[contains(@id, ' TilesTable-table')]/tbody")
            td_list = table.find_elements_by_xpath(".//tr/*[contains(@id, '-col1')]")
            for td in td_list:
                tile_title = td.text
                sh_tile = wb["Tuiles"]
                sh_tile.append([catalog, tile_title])
            last_row = driver.find_element_by_xpath(".//tr/*[contains(@id, ' TilesTable-rows-row19-col1')]")
            last_row_old = driver.find_element_by_xpath(".//tr/*[contains(@id, ' TilesTable-rows-row19-col1')]").text
            last_row.click()
            last_row.send_keys(Keys.PAGE_DOWN)
            time.sleep(0.5)
            last_row_new = driver.find_element_by_xpath(".//tr/*[contains(@id, ' TilesTable-rows-row19-col1')]").text
    except selenium.common.exceptions.NoSuchElementException:
        pass

相关问题更多 >

编程相关推荐

热门问题

热门文章

从动态HTML选项卡中提取所有数据

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >