无法使用BeautifulSoup访问<source>标记的['src']属性

import requests from bs4 import BeautifulSoup gyfyUrl = 'https://gfycat.com/PoshDearAsianporcupine' # creating a response object r = requests.get(gyfyUrl) # creating beautiful soup object soup = BeautifulSoup(r.content,'html5lib') # finding source tags in page sourceTags = soup.findAll('source') #printing found tags for clarity print(sourceTags) # printing src attribute within source tags - Error for tag in sourceTags: print(tag['src'])

1条回答

网友

1楼 · 发布于 2024-05-29 08:25:06

这里的问题是不是每个source标记都有src属性，在本例中，第一个标记没有。您可以使用如下条件列表理解来收集所有src属性（如果存在）：

srcs = [tag["src"] for tag in sourceTags if "src" in tag.attrs]

结果：

['https://giant.gfycat.com/PoshDearAsianporcupine.webm', 'https://giant.gfycat.com/PoshDearAsianporcupine.mp4', 'https://thumbs.gfycat.com/PoshDearAsianporcupine-mobile.mp4']

编程相关推荐

java如何在Rxjava中更改列表时通知obsever
java如何验证spring MVC web app中是否设置了连接池？
从Textview选择文本时出现安卓错误（java.lang.IndexOutOfBoundsException:setSpan（1…1）在0之前开始）
javakotlin：作为方法参数的接口
java将列强制转换为hibernate条件中的类型
java如何在屏幕上获取输出对象？
java内部调用方法
java Log4j2模式布局+转换模式处的负数
java将EditText转换为浮动安卓 eclipse
对Java继承规则感到困惑

相关问题更多 >

编程相关推荐

热门问题

热门文章