问题描述
我正在尝试使用 scrapy 从这个 HTML 文件中提取值:
<div class="property-secondary-vl__detail">
<div class="row">
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Tipe Properti </div>
<div class="property-secondary-vl__detail__value">
Rumah </div>
</div>
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Tipe Iklan </div>
<div class="property-secondary-vl__detail__value">
Dijual </div>
</div>
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Tipe Market </div>
<div class="property-secondary-vl__detail__value">
Secondary </div>
</div>
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Umur Bangunan </div>
<div class="property-secondary-vl__detail__value">
Lama </div>
</div>
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Harga </div>
<div class="property-secondary-vl__detail__value">
Rp1.800.000.000 </div>
</div>
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Sertifikasi </div>
<div class="property-secondary-vl__detail__value">
SHM </div>
</div>
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Kamar Tidur </div>
<div class="property-secondary-vl__detail__value">
3 </div>
</div>
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Kamar Mandi </div>
<div class="property-secondary-vl__detail__value">
3 </div>
</div>
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Luas Tanah </div>
<div class="property-secondary-vl__detail__value">
104m² </div>
</div>
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Luas Bangunan </div>
<div class="property-secondary-vl__detail__value">
125m² </div>
</div>
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Jumlah Lantai </div>
<div class="property-secondary-vl__detail__value">
3 </div>
</div>
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Carports </div>
<div class="property-secondary-vl__detail__value">
2 </div>
</div>
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Daya Listrik </div>
<div class="property-secondary-vl__detail__value">
5500 watt </div>
</div>
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Interior </div>
<div class="property-secondary-vl__detail__value">
Semi Furnished </div>
</div>
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Tahun Dibangun </div>
<div class="property-secondary-vl__detail__value">
2005 </div>
</div>
</div>
</div>
我想要“__detail__key”作为列标题,而“__detail__value”是值本身。
另一个问题是,在这个特定的房产列表网站上,每个列表都有不同数量的项目。
示例: 在此列表 https://www.99.co/id/properti/rumah-dijual-1-8mily-ciputat-timur-520409782 上,“__detail__key”有 15 列,而在此列表 https://www.99.co/id/properti/rumah-dijual-1-65mily-pamulang-410793202 中,有 16 列。
是否可以使用scrapy来做到这一点?
解决方法
暂无找到可以解决该程序问题的有效方法,小编努力寻找整理中!
如果你已经找到好的解决方法,欢迎将解决方案带上本链接一起发送给小编。
小编邮箱:dio#foxmail.com (将#修改为@)