问题描述
我正在尝试向Louis Vuitton发送请求,但是我使用的所有库都无法获得响应。即使我可以从浏览器访问该站点。我在做错什么吗?
我已经使用过 requests , urllib3 和 BeautifulSoup ,但我对这些库的尝试均未成功。
代码:
from urllib.request import Request,urlopen
from bs4 import BeautifulSoup as soup
url = 'https://us.louisvuitton.com/eng-us/products/pocket-organizer-monogram-other-nvprod2380073v'
req = Request(url,headers={'User-Agent': 'Chrome/84.0.4147.105'})
webpage = urlopen(req).read()
page_soup = soup(webpage,"html.parser")
title = page_soup.find("title")
print(title)
输出: 不给我一个,一直呆到我KeyboardInterrupt。
注意: 该项目的目的是监视网站以检测是否发生任何更改。 (钱包有库存)
解决方法
您headers
不正确。
from urllib.request import Request,urlopen
from bs4 import BeautifulSoup as soup
HEADERS = {
'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML,like Gecko)'
}
url = 'https://us.louisvuitton.com/eng-us/products/pocket-organizer-monogram-other-nvprod2380073v'
req = Request(url,headers=HEADERS)
webpage = urlopen(req).read()
page_soup = soup(webpage,"html.parser")
title = page_soup.find("title")
print(title.text)
输出:
Pocket Organizer Monogram Other in Blue - Small Leather Goods M69679 | LOUIS VUITTON ®