问题描述
我在bs4脚本上堆满了,我需要获取href链接或元内容,那怎么办?基本上我需要得到这个:
<Meta itemprop="image" content="https://resources.reed.co.uk/profileimages/logos/thumbs/logo_71709.png?v=20200828172950">
或
<img src="https://resources.reed.co.uk/profileimages/logos/thumbs/logo_71709.png?v=20200828172950" alt="Posted by Publica Group " width="120" height="50" class=" b-loaded" style="display: inline;">
我尝试用:
logoscrap = soup.find('Meta',attrs={'itemprop': 'image'})
和
logoscrap = soup.find('img',class_="b-loaded").attrs['src']
但是我的代码不起作用...
解决方法
soup.find返回dict对象,您可以直接从dict获取服装
img = soup.find('meta',attrs={'itemprop': 'image'})
logoscrap = img['content']
#output:
https://resources.reed.co.uk/profileimages/logos/thumbs/Logo_71709.png?v=20200828172950
或
img = soup.find('img',class_="b-loaded")
logoscrap = img['src']
#output:
https://resources.reed.co.uk/profileimages/logos/thumbs/Logo_71709.png?v=20200828172950