1688-logistics-scraper/README.md

33 lines
769 B
Markdown
Raw Permalink Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

# 1688-logistics-scraper
从 1688 商品页面提取物流相关数据(重量、尺寸、体积)。
通过 Chrome DevTools Protocol (CDP) 连接到已运行的 Chrome 浏览器自动提取商品属性、SKU 变体中的物流数据,并下载详情图片供进一步分析。
## 前置条件
启动 Chrome 并开启远程调试:
```bash
/Applications/Google\ Chrome.app/Contents/MacOS/Google\ Chrome --remote-debugging-port=9222
```
## 安装
```bash
bash install.sh
```
## 使用
```bash
bun scripts/run.ts scrape 'https://detail.1688.com/offer/852504650877.html'
```
## 数据来源
1. 商品属性表(商品属性/商品参数)
2. SKU/变体规格
3. 物流信息区域
4. 商品详情图片(下载到 `/tmp/1688-logistics/<offer-id>/`