Go to file
ywkj bff990628b fix: use localhost for CDP (IPv6), prevent null overwrite on dimensions
- CDP discovery uses localhost instead of 127.0.0.1 (Chrome binds IPv6)
- Only overwrite logistics fields when parsing succeeds, preventing
  later unparseable keys from nullifying valid parsed values

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-30 12:08:41 +08:00
.forgejo/workflows feat: initial skill template 2026-03-14 10:35:01 +08:00
scripts feat: default CDP port 18800, add 包装信息 and 商品件重尺 extraction 2026-03-30 08:20:46 +08:00
src fix: use localhost for CDP (IPv6), prevent null overwrite on dimensions 2026-03-30 12:08:41 +08:00
.gitignore feat: initial skill template 2026-03-14 10:35:01 +08:00
README.md feat: 1688 logistics scraper — extract weight/size from product pages 2026-03-30 07:49:58 +08:00
SKILL.md feat: default CDP port 18800, add 包装信息 and 商品件重尺 extraction 2026-03-30 08:20:46 +08:00
install.sh fix: restore auth-cli.ts and install.sh for org CI registration 2026-03-30 08:00:56 +08:00
package.json feat: 1688 logistics scraper — extract weight/size from product pages 2026-03-30 07:49:58 +08:00

README.md

1688-logistics-scraper

从 1688 商品页面提取物流相关数据(重量、尺寸、体积)。

通过 Chrome DevTools Protocol (CDP) 连接到已运行的 Chrome 浏览器自动提取商品属性、SKU 变体中的物流数据,并下载详情图片供进一步分析。

前置条件

启动 Chrome 并开启远程调试:

/Applications/Google\ Chrome.app/Contents/MacOS/Google\ Chrome --remote-debugging-port=9222

安装

bash install.sh

使用

bun scripts/run.ts scrape 'https://detail.1688.com/offer/852504650877.html'

数据来源

  1. 商品属性表(商品属性/商品参数)
  2. SKU/变体规格
  3. 物流信息区域
  4. 商品详情图片(下载到 /tmp/1688-logistics/<offer-id>/