BS4 Python获得href网址

问题描述

我在bs4脚本上堆满了,我需要获取href链接或元内容,那怎么办?基本上我需要得到这个:

<Meta itemprop="image" content="https://resources.reed.co.uk/profileimages/logos/thumbs/logo_71709.png?v=20200828172950">

<img src="https://resources.reed.co.uk/profileimages/logos/thumbs/logo_71709.png?v=20200828172950" alt="Posted by Publica Group " width="120" height="50" class=" b-loaded" style="display: inline;">

我尝试用:

            logoscrap = soup.find('Meta',attrs={'itemprop': 'image'})

            logoscrap = soup.find('img',class_="b-loaded").attrs['src']

但是我的代码不起作用...

解决方法

soup.find返回dict对象,您可以直接从dict获取服装

img = soup.find('meta',attrs={'itemprop': 'image'})
logoscrap = img['content']
#output:
https://resources.reed.co.uk/profileimages/logos/thumbs/Logo_71709.png?v=20200828172950

img = soup.find('img',class_="b-loaded")
logoscrap  = img['src']
#output:
https://resources.reed.co.uk/profileimages/logos/thumbs/Logo_71709.png?v=20200828172950