如何从网站上删除结果的名称?

2024-06-12 01:30:42 发布

您现在位置:Python中文网/ 问答频道 /正文

我想删除以下网站的结果名称:RAJASTHAN TECHNICAL UNIVERSITY, KOTA

下面是我当前用Python编写的代码:

import requests
from bs4 import BeautifulSoup

headers = {'User-Agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_10_1) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/39.0.2171.95 Safari/537.36'}

response = requests.get(url, headers=headers)

soup = BeautifulSoup(response.text, "lxml")

print(soup)

尝试打印soup时出现以下错误:

Couldn't access the inner elements of frame name="mainFrame" src="mainpage.php"/>

Tags: 代码fromimport名称网站responserequestsheaders
1条回答
网友
1楼 · 发布于 2024-06-12 01:30:42

有一个iframe需要处理。您可以改为使用iframe src作为起点,并为POST请求收集所需的参数,该请求与page对结果所做的操作相匹配

import requests
from bs4 import BeautifulSoup as bs

with requests.Session() as s:
    s.headers = {'Referer': 'http://www.esuvidha.info/rtURes/ResMainpage.php'}
    r = s.get('http://www.esuvidha.info/mainpage.php')
    soup= bs(r.content, 'lxml')
    data = {i['id']:i['value'] for i in soup.select('[type="hidden"][id]')}
    r = s.post('http://www.esuvidha.info/rtURes/ResMainpage.php', data=data)
    soup = bs(r.content, 'lxml')
    print(soup.select_one('fieldset > font').text)

enter image description here

相关问题 更多 >