feat(smolvla2-runtime): overfit/memorisation diagnostics on the panel

The autonomous-mode panel now surfaces what the model is *actually* producing at every chunk boundary, not just what got accepted: * last_subtask_raw most recent generation (accepted or not) * subtask_repeat_count times the same accepted string regenerated * subtask_gibberish_count rejections by the gibberish filter * memory_gibberish_count / plan_gibberish_count for the other heads These let the operator see memorisation collapse without scrolling back through logs: subtask diag repeat:8 gibberish:0 last_raw: '<same string>' ^^^^^^^^^^ → model can't move past current phase subtask diag repeat:0 gibberish:14 last_raw: 'Ass:::' ^^^^^^^^^^^^^^^^^^^^^^ → LM collapsed to template salad Also silences the per-action ``Relative goal position magnitude had to be clamped`` warning. The clamp fires every dispatch tick when the model emits stale joint targets, flooding the panel at ctrl_hz=30. Replaced the bare ``logging.warning`` call in robots/utils.py with a module logger so it can be selectively raised to ERROR. Operators who need the per-tick clamp detail can use ``-v``. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-06-03 04:11:24 +00:00 · 2026-05-12 17:31:04 +02:00
parent c98c695127
commit aecb80a9d2
3 changed files with 74 additions and 3 deletions
--- a/src/lerobot/scripts/lerobot_smolvla2_runtime.py
+++ b/src/lerobot/scripts/lerobot_smolvla2_runtime.py
@@ -883,6 +883,37 @@ def _make_state_panel_renderer(
            f"dispatched: {dispatched}    "
            f"pending tool calls: {pending}[/]"
        )
+
+        # Overfit / memorisation diagnostics. The high-level steps
+        # surface the raw generation each time they fire (even when
+        # rejected as gibberish or unchanged), plus repeat/rejection
+        # counters. Rule of thumb:
+        #
+        #   * subtask repeat ≥ ~5 and queue_len cycles fully → model
+        #     can't move past current subtask (memorised one phase
+        #     of the task — classic overfit signature)
+        #   * subtask gibberish climbing → LM head collapsed to
+        #     chat-template fragments / one-token salads
+        #   * last raw differs from accepted → at least the LM is
+        #     varying, the gibberish filter is doing its job
+        raw_subtask = st.get("last_subtask_raw")
+        sub_rep = int(st.get("subtask_repeat_count") or 0)
+        sub_gib = int(st.get("subtask_gibberish_count") or 0)
+        if raw_subtask is not None or sub_rep or sub_gib:
+            raw_display = (raw_subtask or "(empty)")[:80]
+            color = "yellow" if (sub_rep >= 3 or sub_gib >= 3) else "dim"
+            console.print(
+                f"  [{color}]subtask diag    repeat:{sub_rep}  "
+                f"gibberish:{sub_gib}  last_raw: {raw_display!r}[/]"
+            )
+
+        # Same diagnostics for memory and plan when available.
+        mem_gib = int(st.get("memory_gibberish_count") or 0)
+        plan_gib = int(st.get("plan_gibberish_count") or 0)
+        if mem_gib or plan_gib:
+            console.print(
+                f"  [dim]gen rejects     memory:{mem_gib}  plan:{plan_gib}[/]"
+            )
        console.rule(style="cyan")
        if robot_lines:
            for line in robot_lines:
@@ -939,6 +970,16 @@ def _silence_noisy_loggers() -> None:
    ):
        logging.getLogger(name).setLevel(logging.WARNING)

+    # The robot's relative-goal-position clamp warning fires *every*
+    # dispatch tick on a memorised model — the LM is trying to jump
+    # the wrist far past where max_relative_target allows, so the
+    # warning floods the panel at ~30 Hz. Promote it from WARNING to
+    # DEBUG: the dispatch counter on the panel already tells the
+    # operator the loop is running, and the panel itself shows
+    # whether motion is happening. If anyone needs the per-action
+    # clamp detail, ``-v`` puts it back via DEBUG.
+    logging.getLogger("lerobot.robots.utils").setLevel(logging.ERROR)
+

 def main(argv: list[str] | None = None) -> int:
    args = _parse_args(argv)