lerobot-clone/src/lerobot at 7a32f8a72a7f7a56205035ea77d9c0b3f31894ac - lerobot-clone - allenyuan‘s gitea

ydy0615/lerobot-clone

mirror of https://github.com/huggingface/lerobot.git synced 2026-06-04 12:51:27 +00:00

Files

History

Pepijn 7a32f8a72a refactor(recipes): π0.5-style split — action expert conditions on subtask only

Previously ``action_execution`` rendered ``task + plan + memory +
subtask`` into one prefix and ran the flow loss on it. That meant
the action expert was conditioned on the full hierarchical context
(closer to π0.7 §V.A), not just the subtask.

The π0.5 paper's hierarchical inference has the action expert see
only the *subtask* (plus images and state). Split the recipe to
match:

  high_level_subtask  (0.50)
    user(task + plan + memory) → assistant(subtask)
    [+ assistant(new_memory) at boundary frames]
    All ``stream: high_level`` → text-CE only, no flow loss.

  low_level_execution (0.30)
    user(subtask) → assistant(subtask)
    Both ``stream: low_level`` → flow loss fires; text CE on the
    subtask is a small redundant extra signal. Prefix the action
    expert sees: [images, subtask, state].

  plan_generation (0.10) — unchanged.
  ask_vqa_{top,wrist} (0.05 each) — unchanged.

Runtime: the low-level loop in ``smolvla2/inference/steps.py``
now sends ``[user(subtask), assistant(subtask)]`` to
``predict_action_chunk`` instead of the full task+plan+memory
context. Falls back to ``state['task']`` when no subtask has been
generated yet so the first frame still has something to condition
on.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-05-13 14:13:07 +02:00

..

feat(annotate): compact steerable annotation prompts

2026-05-04 15:57:04 +02:00

async_inference

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

refactor(imports): enforce guard pattern (#3382 )

2026-04-14 22:54:05 +02:00

refactor(imports): enforce guard pattern (#3382 )

2026-04-14 22:54:05 +02:00

refactor(recipes): π0.5-style split — action expert conditions on subtask only

2026-05-13 14:13:07 +02:00

data_processing

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

fix(datasets,annotate): tag pushed dataset + clean revision error

2026-05-05 18:23:18 +02:00

feat(sim): VLABench benchmark integration (#3396 )

2026-04-21 17:54:11 +02:00

refactor(imports): enforce guard pattern (#3382 )

2026-04-14 22:54:05 +02:00

refactor(imports): enforce guard pattern (#3382 )

2026-04-14 22:54:05 +02:00

refactor(imports): enforce guard pattern (#3382 )

2026-04-14 22:54:05 +02:00

refactor(recipes): π0.5-style split — action expert conditions on subtask only

2026-05-13 14:13:07 +02:00

fix(smolvla2): train on rendered language batches

2026-05-05 08:55:56 +00:00

fix(rl): swap dict merge order to preserve teleop intervention flag (#3273 )

2026-04-14 16:20:54 +02:00

feat(smolvla2-runtime): overfit/memorisation diagnostics on the panel

2026-05-12 17:31:04 +02:00

feat(pi052): auto-fit FAST tokenizer per-dataset before training

2026-05-13 11:52:31 +02:00

refactor(imports): enforce guard pattern (#3382 )

2026-04-14 22:54:05 +02:00

chore(policies): deprecate pi0fast (#2203 )

2025-10-14 16:00:42 +02:00

feat(tools): src/lerobot/tools/ — runnable tool registry + SayTool

2026-04-30 18:58:04 +02:00

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

fix(smolvla2): train on rendered language batches

2026-05-05 08:55:56 +00:00

__init__.py

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

__version__.py

Package folder structure (#1417 )

2025-07-01 16:34:46 +02:00

types.py

chore(dependecies): untangle dependecies across internal modules (#3149 )

2026-03-15 20:26:06 -07:00