Python mechanize提示控件不存在
我正在用Python抓取一个需要密码的网站。我的代码如下:
import mechanize
import cookielib
from BeautifulSoup import BeautifulSoup
br = mechanize.Browser()
cj = cookielib.LWPCookieJar()
br.set_cookiejar(cj)
br.set_handle_equiv(True)
br.set_handle_gzip(True)
br.set_handle_redirect(True)
br.set_handle_referer(True)
br.set_handle_robots(False)
br.set_handle_refresh(mechanize._http.HTTPRefreshProcessor(), max_time=1)
br.addheaders = [('User-agent', 'Chrome')]
br.open('https://monitor1.returnpath.net/login.php')
for f in br.forms():
print f
br.select_form(nr=1)
br.form['email'] = 'email'
br.form['password'] = 'password'
这个循环返回了以下内容:
<form1 POST https://monitor1.returnpath.net/login.php application/x-www-form-urlencoded <TextControl(email=)> <PasswordControl(password=)> <CheckboxControl(remember=[1])> <SubmitControl(Submit=Sign In) (readonly)>> <GET http://now.eloqua.com/e/f2.aspx application/x-www-form-urlencoded <TextControl(e=)> <HiddenControl(lang=NA) (readonly)> <HiddenControl(elqSiteID=841) (readonly)> <HiddenControl(elqFormName=nLRegFooter-1347904420246) (readonly)> <SubmitControl(<None>=Sign Me Up) (readonly)> <SubmitButtonControl(<None>=) (readonly)>> <POST https://monitor1.returnpath.net/send_feedback.php application/x-www-form-urlencoded <HiddenControl(size=) (readonly)> <HiddenControl(nps=) (readonly)> <TextareaControl(desc=)> <IgnoreControl(<None>=<None>)> <IgnoreControl(<None>=<None>)> <IgnoreControl(<None>=<None>)> <IgnoreControl(<None>=<None>)> <IgnoreControl(<None>=<None>)> <IgnoreControl(<None>=<None>)> <IgnoreControl(<None>=<None>)> <IgnoreControl(<None>=<None>)> <IgnoreControl(<None>=<None>)> <IgnoreControl(<None>=<None>)> <IgnoreControl(<None>=<None>)>>
同时也出现了这个错误:
mechanize._form.ControlNotFoundError: 找不到名为 'email' 的控件
输出信息显示找到了 'email',所以我不明白为什么会说没有匹配的控件呢?
1 个回答
1
这个是从零开始计数的。你可以试试下面的代码:
br.select_form(nr=0)