docs(annotate): cleaner architecture diagram layout

Top-down flow (read episodes → 3 modules fan out → validator → writer →
parquet) with aligned boxes, instead of the cramped bordered version.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
This commit is contained in:
Pepijn
2026-06-04 11:59:31 +02:00
parent 7bec991cdf
commit 2af2402a0c

View File

@@ -14,21 +14,32 @@ trained on.
## How it fits together
```text
your dataset lerobot-annotate
(LeRobot v3.1) ┌──────────────────────────────────┐
│ │ read episodes
└─────────────▶│ │ │
│ ▼ │
one shared ┌──────┐ ┌─────────────┐ ┌─────┐ │ each module writes
Qwen-VL server ────▶│ │ plan │ │interjections│ │ vqa │ │ raw JSONL into
(vLLM, OpenAI API) └──┬───┘ └──────┬──────┘ └──┬──┘ │ .annotate_staging/
│ └────────────┼───────────┘ │
validator │ checks everything
│ writer ──────────────┼─▶ data/chunk-*/file-*.parquet
└──────────────────────────────────┘ (+ meta/info.json tools)
your dataset lerobot-annotate
(LeRobot v3.1)
┌─────────────────────────────────────────────────────┐
read episodes │
└──────────────────────────┬──────────────────────────┘
────────────────────┼────────────────────┐
┌──────────┐ ┌───────────────┐ ┌──────────┐ one shared Qwen-VL
plan │ │ interjections │ vqa◀── server (vLLM, OpenAI
└────┬─────┘ └───────┬───────┘ └────┬─────┘ API) drives all three
└────────────────────┼─────────────────────┘
│ each module stages raw JSONL
▼ into .annotate_staging/
┌─────────────────┐
│ validator │ ◀── checks everything
└────────┬────────┘
┌─────────────────┐
│ writer │
└────────┬────────┘
data/chunk-*/file-*.parquet
(+ meta/info.json tools)
```
Three modules (`plan`, `interjections`, `vqa`) all talk to **one** shared