充满变数的靓汤

#Used to create file with open('departures.csv', mode='r') as csv_file: csv_reader = csv.DictReader(csv_file) for row in csv_reader: browser.get(row['link']) page = BeautifulSoup(browser.page_source, 'lxml') html = page.prettify() with open("output1.html", "w") as file: file.write(unicode(html)) #Code I want to Run right now it just returns an empty list position = page.find_all('span', class_= 'keyword')

<span class="keyword"> Account Manager</span> Small Piece of Actual HTML returned: <code id="profile-data" style="display: none;"> <!--{"breadcrumbs":{"customSearchURL":"/recruiter/smartsearch? updateSearchHistory=false&decorateHits=true&decorateFacets=false&doFacetCounting=true&searchHistoryId=3392867616&resetFacets=false&searchCacheKey=f4b1a865-50e8-4f59-ba48-9dff595e63e5%2CoUbi&searchRequestId=4d25da0f-1f73-4722-8586-9652b3f98b97%2CQSZO&doResultCaching=false&forceResultFromCache=false&origin=PPSL&doProjectBasedCounting=false&count=25&start=700","linkContext":"Controller:smartSearch,Action:search,ID:3392867616","context":

1条回答

网友

1楼 · 发布于 2024-05-14 13:47:57

LinkedIn使用大量JavaScript来生成您在浏览器中看到的页面。开发人员工具中的DOM元素检查器显示JS执行的当前结果，而不是浏览器下载的原始HTML页面。你知道吗

要在浏览器中查看实际的HTML页源，请使用“查看源”（Ctrl+U或Command+U）。它应该显示类似于Python的HTML。你知道吗

如果需要对最终生成的DOM输出执行一些刮取操作，则可能需要使用可以执行JavaScript（如Chrome controlled by Puppeteer）的headless browser。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章