Python - BeautifulSoup获取从player_d的字符串

2024-04-25 12:29:18 发布

您现在位置:Python中文网/ 问答频道 /正文

我在做一个简单的项目,我遇到了一个问题。我想从"div player_data="获取字符串。这是div

<div id="mediaplayer60597053" 
     player_data='{
      "id": "mediaplayer60597053",
      "ads": {
        "schedule": [{
          "enabled": true,
          "counter": false,
          "skip": true,
          "click": true,
          "key": "",
          "tag": "https:\/\/www.cda.pl\/xml.php?type=g_embed&get=pool&ts=1500453286",
          "repeat": 1,
          "time": 0,
          "type": "pool",
          "displayAs": "prerol"
        }]
      },
      "video": {
        "id": "60597053",
        "file": "http:\/\/vrbx072.cda.pl\/dYXEHM8Nw3y_TZTmTs4e0g\/1500496486\/vl9afb2190473cc908d0c33cdb15bb212994083ca30c797154058bc8717c4ca746.mp4",
        "manifest": null,
        "duration": "6115",
        "durationFull": "01:41:55",
        "poster": "\/\/static.cda.pl\/v001\/img\/mobile\/poster16x9.png",
        "type": "plain",
        "width": 1920,
        "height": 816,
        "content_rating": null,
        "quality": "vl",
        "ts": 1500453286,
        "hash": "26be0bc36e8575c32ff32f4329a301889d1f6f7a"
      },
      "nextVideo": null,
      "autoplay": false,
      "seekTo": 0,
      "premium": false,
      "api": {
        "client": "json_client",
        "ts": "1500453286_60686",
        "key": "9a3859a86e909430bd379badfa68d0d712603626",
        "method": ""
      },
      "user": {
        "role": "guest"
      }
    }' 
     tabindex="1">
</div>

我想得到这个字符串:

"http:\/\/vrbx072.cda.pl\/dYXEHM8Nw3y_TZTmTs4e0g\/1500496486\/vl9afb2190473cc908d0c33cdb15bb212994083ca30c797154058bc8717c4ca746.mp4

谢谢你的帮助。你知道吗


Tags: key字符串dividfalsetruedatatype
1条回答
网友
1楼 · 发布于 2024-04-25 12:29:18

看起来您需要获取div,然后从中提取json对象。可以使用soup.find提取div,然后使用json.loads将json字符串转换为python字典。你知道吗

import json

div = soup.find('div', {'id' : 'mediaplayer60597053' })
data = json.loads(div['player_data'])

print(data['video']['file'])
# 'http://vrbx072.cda.pl/dYXEHM8Nw3y_TZTmTs4e0g/1500496486/vl9afb2190473cc908d0c33cdb15bb212994083ca30c797154058bc8717c4ca746.mp4'

相关问题 更多 >