亚马逊提供API Python

问题描述

我编写了这段代码,以在amazon.it的IT类别中找到要约,但是在containers = page_soup.findAll("div",{"class": "a-row dealContainer dealTile"})行中却找不到页面上的html标签

这是python代码:

    from bs4 import BeautifulSoup as soup
    from urllib.request import urlopen as uReq
    
    # URl to web scrap from.
    first_url = "http://www.amazon.it/gp/goldbox/ref=gbps_ftr_s-5_9fdc_page_2?gb_f_deals1=page:"
    second_url = ",sortOrder:BY_SCORE,enforcedCategories:425916031,dealsPerPage:40&pf_rd_p=e3352cfd-6885-47c5-8c7b-040f48979fdc&pf_rd_s=slot-5&pf_rd_t=701&pf_rd_i=gb_main&pf_rd_m=A11IL2PNWYJU7H&pf_rd_r=72HHSM8FG394PDYPFGMZ&ie=UTF8"
    for index in range(1,9):
        page_url = first_url + str(index) + second_url
        uClient = uReq(page_url)
        page_soup = soup(uClient.read(),"html.parser")
        uClient.close()
    
        containers = page_soup.findAll("div",{"class": "a-row dealContainer dealTile"})
        print(containers)
        out_filename = "offers.csv"
        headers = "product_name,original_price,offers_price,discount \n"
    
        f = open(out_filename,"w")
        f.write(headers)
    
        for container in containers:
            product_name = container.div.select("span").text
            original_price = float(container.div.select("span")[3].text.replace(",",".").replace(".",""))
            offers_price = float(container.div.select("span")[1].text.replace(",""))
            discount = container.div.select("span")[4].text
            print("product_name: " + product_name + "\n")
            print("original_price: " + original_price + "\n")
            print("offers_price: " + offers_price + "\n")
            print("discount: " + discount + "\n")
            f.write(product_name + "," + original_price + "," + offers_price + "," + discount + "\n")
    f.close()

解决方法

暂无找到可以解决该程序问题的有效方法,小编努力寻找整理中!

如果你已经找到好的解决方法,欢迎将解决方案带上本链接一起发送给小编。

小编邮箱:dio#foxmail.com (将#修改为@)

相关问答

依赖报错 idea导入项目后依赖报错,解决方案:https://blog....
错误1:代码生成器依赖和mybatis依赖冲突 启动项目时报错如下...
错误1:gradle项目控制台输出为乱码 # 解决方案:https://bl...
错误还原:在查询的过程中,传入的workType为0时,该条件不起...
报错如下,gcc版本太低 ^ server.c:5346:31: 错误:‘struct...