如何使用beautifulsoup提取h1标签文本

问题描述

对于您提供的当前链接,您可以这样获得:

company = soup.select('h1.listing-name')[0].text.strip()
print(company)

输出

Astra Hôtel Vevey 4*sup

解决方法

我想了解如何使用漂亮的汤来提取其中包含许多其他标签的h1标签文本:

<h1 class="listing-name">

Hôtel Vevey

<span class="entry-feedbacks-summary-title-rating-stars-container bootstrap">

<span class="entry-feedbacks-summary-title-rating-stars entry-feedbacks-summary-title-rating-stars-empty" data-container=".entry-feedbacks-summary-title-rating-stars-container" data-content="Il n'y a pas encore d'avis de clients à propos de Astra Hôtel Vevey 4*sup. Cliquez pour évaluer." data-placement="right" data-toggle="popover" data-trigger="hover" data-original-title="" title="">

<a class="feedback-login-link entry-feedbacks-header-link" href="/auth/localch?origin=https%3A%2F%2Ftel.local.ch%2Ffr%2Fd%2FVevey%2F1800%2FHotel%2FAstra-Hotel-Vevey-4sup-SVGb8b5z-QdrzGTddmyAAg%3Fwhat%3DHotel%26where%3DVaud%2B%2528Canton%2529%23entry-feedbacks-bottom-rate-button"><span class="entry-feedback-rating-star">

<i class="icon-star-outline entry-feedback-rating-star-empty"></i>

</span>

<span class="entry-feedback-rating-star">

<i class="icon-star-outline entry-feedback-rating-star-empty"></i>

</span>

<span class="entry-feedback-rating-star">

<i class="icon-star-outline entry-feedback-rating-star-empty"></i>

</span>

<span class="entry-feedback-rating-star">

<i class="icon-star-outline entry-feedback-rating-star-empty"></i>

</span>

<span class="entry-feedback-rating-star">

<i class="icon-star-outline entry-feedback-rating-star-empty"></i>

</span>



</a></span>



</span>

</h1>

我正在尝试在h1标签“hôtelVevey”之后提取文本。

import requests
from bs4 import BeautifulSoup

url = "https://tel.local.ch/fr/d/Vevey/1800/Hotel/Astra-Hotel-Vevey-4sup-SVGb8b5z-QdrzGTddmyAAg?what=Hotel&where=Vaud+%28Canton%29"
get_url = requests.get(url)
get_text = get_url.text
soup = BeautifulSoup(get_text,"html.parser")

company = soup.find_next('h1','class:listing-name')


print(company)

它使我“无”