我有追随者
html_source = """{"linkparam":"CDAQ46598omxw=","linkmetadata":{"weblinkmetadata":{"url":"/service_ajax","sendPost":true}},"formfield":{"action":"CAUaMVVnd2t2Z1htRGl3OXAtS0FVaUY0QWFBQkNRLjhtZmduZEgzWXI4OG1maDFJMjRiV0gwATgAShUxMDIwMTQzMTg0NzMxMTE4NzMxNzBaGFVDQjBkMEpMbjFXY0dZY3d3Wjg3ZDJMQXAA","clientActions":[{"formaction":{"voteCount":{"accessibility":{"accessibilityData":{"label":"11 status"}},"simpleText":"11"},"formstatus":"FORM"}}]}}
#below part i want to extract from page including curly braces
{"linkparam":"CDAQ46597omxw=","linkmetadata":{"weblinkmetadata":{"url":"/service_ajax","sendPost":true}},"formfield":{"action":"CAUaMVVnd2t2Z1htRGl3OXAtS0FVaUY0QWFBQkNRLjhtZmduZEgzWXI4OG1maDFJMjRiV0gwATgAShUxMDIwMTQzMTg0NzMxMTE4NzMxNzBaGFVDQjBkMEpMbjFXY0dZY3d3Wjg3ZDJMQXAA","clientActions":[{"formaction":{"voteCount":{"accessibility":{"accessibilityData":{"label":"11 status"}},"simpleText":"11"},"formstatus":"FORM"}}]}}
#above part i want to extract from page including curly braces
{"linkparam":"CDAQ46448omxw=","linkmetadata":{"weblinkmetadata":{"url":"/service_ajax","sendPost":true}},"formfield":{"action":"BQkNRLjhtZmduZEgzWXI4OG1maDFJMjRiV0gwATgAShUxMDIwMTQzMTg0NzMxMTE4NzMxNzBaGFVDQjBkMEpMbjFXY0dZY3d3Wjg3ZDJMQXAA","clientActions":[{"formaction":{"voteCount":{"accessibility":{"accessibilityData":{"label":"11 status"}},"simpleText":"11"},"formstatus":"FORM"}}]}}"""a
m = re.search(r"\{(.*?)\}", html_source)
我想从页面字符串中提取这部分
{"linkparam":"CDAQ46597omxw=","linkmetadata":{"weblinkmetadata":{"url":"/service_ajax","sendPost":true}},"formfield":{"action":"CAUaMVVnd2t2Z1htRGl3OXAtS0FVaUY0QWFBQkNRLjhtZmduZEgzWXI4OG1maDFJMjRiV0gwATgAShUxMDIwMTQzMTg0NzMxMTE4NzMxNzBaGFVDQjBkMEpMbjFXY0dZY3d3Wjg3ZDJMQXAA","clientActions":[{"formaction":{"voteCount":{"accessibility":{"accessibilityData":{"label":"11 status"}},"simpleText":"11"},"formstatus":"FORM"}}]}}
您的数据看起来像是由注释分隔的json项的列表(以“#”开头的行)
因此,您可以用“,”替换注释,并用“[”和“]”包装数据,以创建一个json列表
然后,您可以使用json库来解析此项列表并提取第二个项:
你会得到:
如果您没有评论…
你可以做:
相关问题 更多 >
编程相关推荐