docs(annotate): cleaner architecture diagram layout

Top-down flow (read episodes → 3 modules fan out → validator → writer → parquet) with aligned boxes, instead of the cramped bordered version. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-06-04 21:01:26 +00:00 · 2026-06-04 11:59:31 +02:00
parent 7bec991cdf
commit 2af2402a0c
1 changed files with 26 additions and 15 deletions
--- a/docs/source/annotation_pipeline.mdx
+++ b/docs/source/annotation_pipeline.mdx
@@ -14,21 +14,32 @@ trained on.
 ## How it fits together

 ```text
-  your dataset                lerobot-annotate
-  (LeRobot v3.1)        ┌──────────────────────────────────┐
-        │              │   read episodes                    │
-        └─────────────▶│        │                           │
-                       │        ▼                           │
-   one shared          │   ┌──────┐ ┌─────────────┐ ┌─────┐ │  each module writes
-   Qwen-VL server ────▶│   │ plan │ │interjections│ │ vqa │ │  raw JSONL into
-   (vLLM, OpenAI API)  │   └──┬───┘ └──────┬──────┘ └──┬──┘ │  .annotate_staging/
-                       │      └────────────┼───────────┘    │
-                       │                   ▼                 │
-                       │               validator             │  checks everything
-                       │                   │                 │
-                       │                   ▼                 │
-                       │                writer ──────────────┼─▶ data/chunk-*/file-*.parquet
-                       └──────────────────────────────────┘     (+ meta/info.json tools)
+  your dataset                  lerobot-annotate
+  (LeRobot v3.1)
+        │
+        ▼
+  ┌─────────────────────────────────────────────────────┐
+  │                    read episodes                     │
+  └──────────────────────────┬──────────────────────────┘
+                             │
+        ┌────────────────────┼────────────────────┐
+        ▼                    ▼                     ▼
+  ┌──────────┐      ┌───────────────┐        ┌──────────┐       one shared Qwen-VL
+  │   plan   │      │ interjections │        │   vqa    │  ◀──   server (vLLM, OpenAI
+  └────┬─────┘      └───────┬───────┘        └────┬─────┘        API) drives all three
+       └────────────────────┼─────────────────────┘
+                            │   each module stages raw JSONL
+                            ▼   into .annotate_staging/
+                  ┌─────────────────┐
+                  │    validator    │  ◀──  checks everything
+                  └────────┬────────┘
+                           ▼
+                  ┌─────────────────┐
+                  │     writer      │
+                  └────────┬────────┘
+                           ▼
+              data/chunk-*/file-*.parquet
+              (+ meta/info.json tools)
 ```

 Three modules (`plan`, `interjections`, `vqa`) all talk to **one** shared