Go to file
ywkj 935cac3c61
register-skill-release / register (push) Successful in 13s Details
feat: extract window.context.result.data for structured logistics data
Poll window.context.result.data (up to 15s) for productPackInfo,
productTitle, productAttributes, and skuSelection. This provides
structured weight/size/volume data per-variant directly from 1688's
JS context — more reliable than vision-only extraction.

Screenshots still captured as fallback for data only in images.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-30 14:05:53 +08:00
.forgejo/workflows feat: initial skill template 2026-03-14 10:35:01 +08:00
scripts feat: default CDP port 18800, add 包装信息 and 商品件重尺 extraction 2026-03-30 08:20:46 +08:00
src feat: extract window.context.result.data for structured logistics data 2026-03-30 14:05:53 +08:00
.gitignore feat: initial skill template 2026-03-14 10:35:01 +08:00
README.md feat: 1688 logistics scraper — extract weight/size from product pages 2026-03-30 07:49:58 +08:00
SKILL.md feat: define structured JSON output schema for API consumption 2026-03-30 12:18:15 +08:00
install.sh fix: restore auth-cli.ts and install.sh for org CI registration 2026-03-30 08:00:56 +08:00
package.json feat: 1688 logistics scraper — extract weight/size from product pages 2026-03-30 07:49:58 +08:00

README.md

1688-logistics-scraper

从 1688 商品页面提取物流相关数据(重量、尺寸、体积)。

通过 Chrome DevTools Protocol (CDP) 连接到已运行的 Chrome 浏览器自动提取商品属性、SKU 变体中的物流数据,并下载详情图片供进一步分析。

前置条件

启动 Chrome 并开启远程调试:

/Applications/Google\ Chrome.app/Contents/MacOS/Google\ Chrome --remote-debugging-port=9222

安装

bash install.sh

使用

bun scripts/run.ts scrape 'https://detail.1688.com/offer/852504650877.html'

数据来源

  1. 商品属性表(商品属性/商品参数)
  2. SKU/变体规格
  3. 物流信息区域
  4. 商品详情图片(下载到 /tmp/1688-logistics/<offer-id>/