如何使用Python在脚本中获取JSON数据

import urllib2 from bs4 import BeautifulSoup import re import json url = "https://www.exampleURL.com" page = urllib2.urlopen(url) soup = BeautifulSoup(page, 'html.parser') scripts = soup.find_all('script') for script in scripts: try: data = json.loads(script) print("Success") except Exception: print("Not Successful")

1条回答

网友

1楼 · 发布于 2024-04-24 21:18:22

在尝试将<script>的内容解析为json之前，需要进行一些数据处理。特别是，您需要删除JavaScript字典前面的__DATA__ =部分。你知道吗

要记住的几件事：

Javascript字典不一定是JSON blob。特别是

示例

{hello: 2}   # Correct JavaScript, incorrect JSON - missing quotes around key
{'hello': 2} # Correct JavaScript, incorrect JSON - Quotes must be double quotes

{"hello": 2} # Correct JSON and JavaScript

一些可能有助于调试的事情

for script in scripts:
    try:
        print(script) # See what you try to load
        data = json.loads(script)
        print("Success")
    except Exception as e:
        print("Not Successful because {}".format(e)) # Print additional information

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何使用Python在脚本中获取JSON数据

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >