diff --git a/src/lerobot/annotations/steerable_pipeline/prompts/module_1_subtasks.txt b/src/lerobot/annotations/steerable_pipeline/prompts/module_1_subtasks.txt index e1c8f822e..e6a5260a7 100644 --- a/src/lerobot/annotations/steerable_pipeline/prompts/module_1_subtasks.txt +++ b/src/lerobot/annotations/steerable_pipeline/prompts/module_1_subtasks.txt @@ -85,6 +85,23 @@ Authoring rules — Hi Robot atom granularity, pi0.7-style short prompts: - Every subtask's [start_time, end_time] must lie within [0.0, {episode_duration}] seconds. +SPECIAL CASES — verb disambiguation (each rule is narrowly visual and +fires ONLY on the spatial situation it names; it must not change how you +label any other situation): +- STACK vs PUT: if an object is placed ON TOP OF another specific object + (not on a flat table / shelf / counter), use "stack ... on ...", not + "put". "stack blue book on green book", NOT "put blue book on table". +- INSERT vs PUT: if an object goes INTO a fitted slot / hole / socket / + receptacle (push-fit), use "insert ... into ...", not "put". +- RETRIEVE/PICK-UP vs PUT (direction): watch the gripper. If it CLOSES + on the object and the object moves WITH the hand, it is "pick up" / + "retrieve" (object leaves its location). If the gripper OPENS and the + object stays where the hand left it, it is "put" / "place" (object + arrives at a location). Decide by which way the object moves, not by + where the hand ends up. +- POUR vs PUT: only use "pour" when the source is tilted and contents + flow out; moving a full container without tilting is "put"/"place". + Output strictly valid JSON of shape: {{