如何在靓汤中刮经纬度

网友

1楼 · 编辑于 2024-05-19 02:54:00

如果您只希望得到一个响应，请执行以下操作：

print links[0]

网友

2楼 · 编辑于 2024-05-19 02:54:00

好的，所以您正确地获取了所有<tr>，现在我们只需要从它们中获取data属性。在

import re
import requests
from bs4 import BeautifulSoup

url = 'http://cinematreasures.org/theaters/united-states?page=1' 
r = requests.get(url)
soup = BeautifulSoup(r.text, "html.parser")
theaters = soup.findAll("tr", class_="theater")
data = [ t.get('data') for t in theaters if t.get('data') ]
print data

不幸的是，这给了您一个字符串列表，而不是一个人们可能希望的dictionary对象。我们可以在数据字符串上使用正则表达式将其转换为dict（谢谢RootTwo）：

^{pr2}$

网友

3楼 · 编辑于 2024-05-19 02:54:00

我的方法是：

import requests
import demjson
from bs4 import BeautifulSoup

url = 'http://cinematreasures.org/theaters/united-states?page=1'
page = requests.get(url)
soup = BeautifulSoup(page.text)

to_plain_coord = lambda d: (d['point']['lng'], d['point']['lat'])
# Grabbing theater coords if `data` attribute exists
coords = [
    to_plain_coord(demjson.decode(t.attrs['data']))
    for t in soup.select('.theater')
    if 'data' in t.attrs]

print(coords)

我不使用任何字符串操作。相反，我从data属性加载JSON。不幸的是，这里不是很有效的JSON，所以我使用demjson库进行JSON解析。在

^{pr2}$

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何在靓汤中刮经纬度

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >