从多个链接抓取文本数据并将抓取的数据存储在csv文件中

from selenium import webdriver from selenium.webdriver.common.keys import Keys driver = webdriver.Chrome(r"C:\Users\Documents\chromedriver") cursor.execute("select TOP (10) Url from TABLE_NAME ORDER BY ID DESC ") data = cursor.fetchall() result_data = cursor.fetchall() for link in result_data: urllink = link.Url driver.get(urllink)

2条回答

网友

1楼 · 编辑于 2024-04-25 22:20:42

创建一个数据框，并将所有数据推入其中，当您完成刮取时。将其导出到csv文件是保存表格数据的最佳方法

网友
2楼 · 编辑于 2024-04-25 22:20:42

查看下面的示例代码格式
import scrapy from selenium import webdriver from selenium.webdriver.common.keys import Keys cursor.execute("select TOP (10) Url from TABLE_NAME ORDER BY ID DESC ") result_data = cursor.fetchall() start_urls = [] for link in result_data: start_urls.append(link.Url) class ToScrapeSpider(scrapy.Spider): name = 'toscrape' start_urls = start_urls def parse(self, response): # Write the code to scrape data, If the URLs are different, use different cases # Also add the items to a pandas data frame, dataframe_name.to_csv('path/to/filename.csv') does the work.

相关问题更多 >

编程相关推荐

热门问题

热门文章