我成功地刮这个网站和数据刮正确。唯一的问题是导出到csv。我用熊猫导出数据,结果弄得乱七八糟。下面是我的代码:
while next_page is not None:
results_element = driver.find_elements_by_xpath('//*[contains(concat( " ", @class, " " ), concat( " ", '
'"label-primary", " " ))]')
results = [x.text for x in results_element]
print(results)
driver.implicitly_wait(5)
ASIN_element = driver.find_elements_by_xpath(
'//*[contains(concat( " ", @class, " " ), concat( " ", "asin-column", '
'" " ))]//a')
ASIN = [x.text for x in ASIN_element]
print(ASIN)
driver.implicitly_wait(5)
Title_element = driver.find_elements_by_css_selector('.asin-column+ td')
Title = [x.text for x in Title_element]
print(Title)
driver.implicitly_wait(5)
Date_element = driver.find_elements_by_css_selector(
'.format-date'), 10
Date = [x for x in Date_element]
print(Date)
driver.implicitly_wait(5)
df = pd.DataFrame(list(zip(results, ASIN, Title, Date)), columns=['results', 'ASIN', 'Product_Title', 'Date'])
beach_balls_data = df.to_csv(f, index=False)
if next_page is not None:
driver.find_element_by_css_selector('.next a').click()
driver.implicitly_wait(5)
elif next_page is None:
iterate = False
driver.implicitly_wait(5)
time.sleep(5)
我只需要正确导出数据而不覆盖任何内容。任何帮助都将不胜感激。你知道吗
下方(不使用熊猫或任何其他图书馆)
输出('输出.csv')
相关问题 更多 >
编程相关推荐