这是我使用TelethonAPI运行python脚本后得到的json文件的一部分,作为输出
[{"_": "Message", "id": 4589, "to_id": {"_": "PeerChannel", "channel_id": 1399858792}, "date": "2020-09-03T14:51:03+00:00", "message": "Looking for product managers / engineers who have worked in search engine / query understanding space. Please PM me if you can connect me to someone for the same", "out": false, "mentioned": false, "media_unread": false, "silent": false, "post": false, "from_scheduled": false, "legacy": false, "edit_hide": false, "from_id": 356886523, "fwd_from": null, "via_bot_id": null, "reply_to_msg_id": null, "media": null, "reply_markup": null, "entities": [], "views": null, "edit_date": null, "post_author": null, "grouped_id": null, "restriction_reason": []}, {"_": "MessageService", "id": 4588, "to_id": {"_": "PeerChannel", "channel_id": 1399858792}, "date": "2020-09-03T11:48:18+00:00", "action": {"_": "MessageActionChatJoinedByLink", "inviter_id": 310378430}, "out": false, "mentioned": false, "media_unread": false, "silent": false, "post": false, "legacy": false, "from_id": 1264437394, "reply_to_msg_id": null}
正如您所看到的,python脚本已经从电报中的特定频道抓取了聊天记录。我所需要的只是将json的日期和消息部分存储到一个单独的数据框架中,以便应用适当的过滤器并给出适当的输出。有人能帮我吗
'[{...}, {...}]'
)。data = json.loads(data)
list
中的每个dict
,并对key
使用dict.get
。如果键不存在,则返回None
或者
'message'
是None
我认为您应该使用json加载,然后使用json_规范化将json转换为嵌套字典的最高级别的数据帧
相关问题 更多 >
编程相关推荐