擅长:python、mysql、java
<p>我的方法是:</p>
<pre><code>import requests
import demjson
from bs4 import BeautifulSoup
url = 'http://cinematreasures.org/theaters/united-states?page=1'
page = requests.get(url)
soup = BeautifulSoup(page.text)
to_plain_coord = lambda d: (d['point']['lng'], d['point']['lat'])
# Grabbing theater coords if `data` attribute exists
coords = [
to_plain_coord(demjson.decode(t.attrs['data']))
for t in soup.select('.theater')
if 'data' in t.attrs]
print(coords)
</code></pre>
<p>我不使用任何字符串操作。相反,我从<code>data</code>属性加载JSON。不幸的是,这里不是很有效的JSON,所以我使用<code>demjson</code>库进行JSON解析。在</p>
^{pr2}$