将JSON导入Pandas

{ "action" : "get", "application" : "4d97323f-ac0f-11e6-b1d4-0eec2415f3df", "params" : { "limit" : [ "2" ] }, "path" : "/businesses", "entities" : [ { "uuid" : "508d56f1-636b-11e7-9928-122e0737977d", "type" : "business", "size" : 730 }, { "uuid" : "2f3bd4dc-636b-11e7-b937-0ad881f403bf", "type" : "business", "size" : 730 } ], "timestamp" : 1499469891059, "duration" : 244, "count" : 2 }

3条回答

网友

1楼 · 编辑于 2024-04-20 00:00:56

使用my_json['entities']的方式使它看起来像是一个Python dict。

根据^{} documentation，read_json接受“有效的JSON字符串或类似文件”。可以使用以下命令将dict转换为json字符串：

import json
json_str = json.dumps(my_json["entities"])

正如您所描述的，键"entities"下的数据不适合orient="split"的格式化策略。看起来您需要使用orient="list"：

import pandas as pd

my_json = """{
    "entities": [
            {
                "type": "business",
                "uuid": "199bca3e-baf6-11e6-861b-0ad881f403bf",
                "size": 918
            },
            {
                "type": "business",
                "uuid": "054a7650-b36a-11e6-a734-122e0737977d",
                "size": 984
            }
        ]
}"""

print pd.read_json(my_json, orient='list')

屈服：

                                              entity
0  {u'type': u'business', u'uuid': u'199bca3e-baf...
1  {u'type': u'business', u'uuid': u'054a7650-b36...

或者

import pandas as pd

my_json = """[
    {
        "type": "business",
        "uuid": "199bca3e-baf6-11e6-861b-0ad881f403bf",
        "size": 918
    },
    {
        "type": "business",
        "uuid": "054a7650-b36a-11e6-a734-122e0737977d",
        "size": 984
    }
]"""

print pd.read_json(my_json, orient='list')

屈服：

   size      type                                  uuid
0   918  business  199bca3e-baf6-11e6-861b-0ad881f403bf
1   984  business  054a7650-b36a-11e6-a734-122e0737977d

网友

2楼 · 编辑于 2024-04-20 00:00:56

如果my_json如我所怀疑的那样是一本字典，那么您可以跳过pd.read_json，只需

pd.DataFrame(my_json['entities'])

   size      type                                  uuid
0   730  business  508d56f1-636b-11e7-9928-122e0737977d
1   730  business  2f3bd4dc-636b-11e7-b937-0ad881f403bf

网友

3楼 · 编辑于 2024-04-20 00:00:56

丹尼尔科林给我指出了正确的方向。最后我不得不：

pd.read_json(json.dumps(b_j['entities']) , orient='list')

read_json方法接受一个字符串，因此我转储entities集合并使用它。

相关问题更多 >

编程相关推荐

热门问题

热门文章