擅长:python、mysql、java
<p>如果要对站点进行爬网,请参见<a href="https://stackoverflow.com/questions/2667509/curl-alternative-in-python">this post</a>。如果您只想处理一些页面并分析其内容(意味着您知道要处理的url),请尝试<a href="http://www.crummy.com/software/BeautifulSoup/" rel="nofollow noreferrer">BeautifulSoup</a>,它允许您执行以下操作:</p>
<pre><code>page = urllib2.urlopen(url)
soup = BeautifulSoup(page.read())
for f in soup.findAll('form'):
target_url = f['action']
#do something with each one of the forms
</code></pre>