在python中请求超时

问题描述

我正在尝试向Louis Vuitton发送请求,但是我使用的所有库都无法获得响应。即使我可以从浏览器访问该站点。我在做错什么吗?

我已经使用过 requests urllib3 BeautifulSoup ,但我对这些库的尝试均未成功。

代码:

from urllib.request import Request,urlopen
from bs4 import BeautifulSoup as soup


url = 'https://us.louisvuitton.com/eng-us/products/pocket-organizer-monogram-other-nvprod2380073v'
req = Request(url,headers={'User-Agent': 'Chrome/84.0.4147.105'})

webpage = urlopen(req).read()
page_soup = soup(webpage,"html.parser")
title = page_soup.find("title")
print(title)

输出: 不给我一个,一直呆到我KeyboardInterrupt。

注意: 该项目的目的是监视网站以检测是否发生任何更改。 (钱包有库存)

解决方法

headers不正确。

from urllib.request import Request,urlopen
from bs4 import BeautifulSoup as soup

HEADERS = {
'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML,like Gecko)'
}

url = 'https://us.louisvuitton.com/eng-us/products/pocket-organizer-monogram-other-nvprod2380073v'
req = Request(url,headers=HEADERS)

webpage = urlopen(req).read()
page_soup = soup(webpage,"html.parser")
title = page_soup.find("title")
print(title.text)

输出:

Pocket Organizer Monogram Other in Blue - Small Leather Goods M69679 | LOUIS VUITTON ®

相关问答

依赖报错 idea导入项目后依赖报错,解决方案:https://blog....
错误1:代码生成器依赖和mybatis依赖冲突 启动项目时报错如下...
错误1:gradle项目控制台输出为乱码 # 解决方案:https://bl...
错误还原:在查询的过程中,传入的workType为0时,该条件不起...
报错如下,gcc版本太低 ^ server.c:5346:31: 错误:‘struct...