Go to file
ywkj a780041840 feat: define structured JSON output schema for API consumption
SKILL.md now specifies exact JSON structure the model must output
after reading screenshots. Weight in kg, dimensions in cm, omit nulls.
Ready for downstream API integration.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-30 12:18:15 +08:00
.forgejo/workflows feat: initial skill template 2026-03-14 10:35:01 +08:00
scripts feat: default CDP port 18800, add 包装信息 and 商品件重尺 extraction 2026-03-30 08:20:46 +08:00
src refactor: replace DOM parsing with vision-based approach 2026-03-30 12:11:24 +08:00
.gitignore feat: initial skill template 2026-03-14 10:35:01 +08:00
README.md feat: 1688 logistics scraper — extract weight/size from product pages 2026-03-30 07:49:58 +08:00
SKILL.md feat: define structured JSON output schema for API consumption 2026-03-30 12:18:15 +08:00
install.sh fix: restore auth-cli.ts and install.sh for org CI registration 2026-03-30 08:00:56 +08:00
package.json feat: 1688 logistics scraper — extract weight/size from product pages 2026-03-30 07:49:58 +08:00

README.md

1688-logistics-scraper

从 1688 商品页面提取物流相关数据(重量、尺寸、体积)。

通过 Chrome DevTools Protocol (CDP) 连接到已运行的 Chrome 浏览器自动提取商品属性、SKU 变体中的物流数据,并下载详情图片供进一步分析。

前置条件

启动 Chrome 并开启远程调试:

/Applications/Google\ Chrome.app/Contents/MacOS/Google\ Chrome --remote-debugging-port=9222

安装

bash install.sh

使用

bun scripts/run.ts scrape 'https://detail.1688.com/offer/852504650877.html'

数据来源

  1. 商品属性表(商品属性/商品参数)
  2. SKU/变体规格
  3. 物流信息区域
  4. 商品详情图片(下载到 /tmp/1688-logistics/<offer-id>/