Go to file
ywkj 20d3529068
register-skill-release / register (push) Successful in 14s Details
fix: use Page.loadEventFired + networkIdle instead of fixed timeout
Replace 15s polling loop with proper CDP event-based page load
detection: wait for Page.loadEventFired, then PerformanceObserver
network idle (no new resource requests for 1s). More reliable and
faster than fixed timeouts.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-31 06:51:51 +08:00
.forgejo/workflows feat: initial skill template 2026-03-14 10:35:01 +08:00
scripts feat: default CDP port 18800, add 包装信息 and 商品件重尺 extraction 2026-03-30 08:20:46 +08:00
src fix: use Page.loadEventFired + networkIdle instead of fixed timeout 2026-03-31 06:51:51 +08:00
.gitignore feat: initial skill template 2026-03-14 10:35:01 +08:00
README.md feat: 1688 logistics scraper — extract weight/size from product pages 2026-03-30 07:49:58 +08:00
SKILL.md feat: define structured JSON output schema for API consumption 2026-03-30 12:18:15 +08:00
install.sh fix: restore auth-cli.ts and install.sh for org CI registration 2026-03-30 08:00:56 +08:00
package.json feat: 1688 logistics scraper — extract weight/size from product pages 2026-03-30 07:49:58 +08:00

README.md

1688-logistics-scraper

从 1688 商品页面提取物流相关数据(重量、尺寸、体积)。

通过 Chrome DevTools Protocol (CDP) 连接到已运行的 Chrome 浏览器自动提取商品属性、SKU 变体中的物流数据,并下载详情图片供进一步分析。

前置条件

启动 Chrome 并开启远程调试:

/Applications/Google\ Chrome.app/Contents/MacOS/Google\ Chrome --remote-debugging-port=9222

安装

bash install.sh

使用

bun scripts/run.ts scrape 'https://detail.1688.com/offer/852504650877.html'

数据来源

  1. 商品属性表(商品属性/商品参数)
  2. SKU/变体规格
  3. 物流信息区域
  4. 商品详情图片(下载到 /tmp/1688-logistics/<offer-id>/