Go to file
ywkj 92a3e2eba3 fix: capture viewport-only screenshots for readable resolution
captureBeyondViewport was capturing the entire page in one giant image
(2804x20746), making text unreadable. Now captures per-viewport with
80% overlap, producing ~2400x1992 screenshots that vision can read.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-30 12:23:00 +08:00
.forgejo/workflows feat: initial skill template 2026-03-14 10:35:01 +08:00
scripts feat: default CDP port 18800, add 包装信息 and 商品件重尺 extraction 2026-03-30 08:20:46 +08:00
src fix: capture viewport-only screenshots for readable resolution 2026-03-30 12:23:00 +08:00
.gitignore feat: initial skill template 2026-03-14 10:35:01 +08:00
README.md feat: 1688 logistics scraper — extract weight/size from product pages 2026-03-30 07:49:58 +08:00
SKILL.md feat: define structured JSON output schema for API consumption 2026-03-30 12:18:15 +08:00
install.sh fix: restore auth-cli.ts and install.sh for org CI registration 2026-03-30 08:00:56 +08:00
package.json feat: 1688 logistics scraper — extract weight/size from product pages 2026-03-30 07:49:58 +08:00

README.md

1688-logistics-scraper

从 1688 商品页面提取物流相关数据(重量、尺寸、体积)。

通过 Chrome DevTools Protocol (CDP) 连接到已运行的 Chrome 浏览器自动提取商品属性、SKU 变体中的物流数据,并下载详情图片供进一步分析。

前置条件

启动 Chrome 并开启远程调试:

/Applications/Google\ Chrome.app/Contents/MacOS/Google\ Chrome --remote-debugging-port=9222

安装

bash install.sh

使用

bun scripts/run.ts scrape 'https://detail.1688.com/offer/852504650877.html'

数据来源

  1. 商品属性表(商品属性/商品参数)
  2. SKU/变体规格
  3. 物流信息区域
  4. 商品详情图片(下载到 /tmp/1688-logistics/<offer-id>/