Refactor`gym_manipulator.py` using the universal pipeline (#1650)

mirror of https://github.com/huggingface/lerobot.git synced 2026-06-02 20:01:25 +00:00

* Migrate gym_manipulator to use the pipeline
Added get_teleop_events function to capture relevant events from teleop devices unrelated to actions

* Added the capability to record a dataset

* Added the replay functionality with the pipeline

* Refactored `actor.py` to use the pipeline

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* RL works at this commit - fixed actor.py and bugs in gym_manipulator

* change folder structure to reduce the size of gym_manip

* Refactored hilserl config

* Remove dataset and mode from HilSerlEnvConfig to a GymManipulatorConfig to reduce verbose of configs during training

* format docs

* removed get_teleop_events from abc

* Refactor environment configuration and processing pipeline for GymHIL support. Removed device attribute from HILSerlRobotEnvConfig, added DummyTeleopDevice for simulation, and updated processor creation to accommodate GymHIL environments.

* Improved typing for HILRobotEnv config and GymManipulator config

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Migrated `gym_manipulator` to use a more modular structure similar to phone teleop

* Refactor gripper handling and transition processing in HIL and robot kinematic processors

- Updated gripper position handling to use a consistent key format across processors
- Improved the EEReferenceAndDelta class to handle reference joint positions.
- Added support for discrete gripper actions in the GripperVelocityToJoint processor.
- Refactored the gym manipulator to improve modularity and clarity in processing steps.

* Added delta_action_processor mapping wrapper

* Added missing file delta_action_processor and improved imports in `gym_manipulator`

* nit

* Added missing file joint_observation_processor

* Enhance processing architecture with new teleoperation processors

- Introduced `AddTeleopActionAsComplimentaryData` and `AddTeleopEventsAsInfo` for integrating teleoperator actions and events into transitions.
- Added `Torch2NumpyActionProcessor` and `Numpy2TorchActionProcessor` for seamless conversion between PyTorch tensors and NumPy arrays.
- Updated `__init__.py` to include new processors in module exports, improving modularity and clarity in the processing pipeline.
- GymHIL is now fully supported with HIL using the pipeline

* Refactor configuration structure for gym_hil integration

- Renamed sections for better readability, such as changing "Gym Wrappers Configuration" to "Processor Configuration."
- Enhanced documentation with clear examples for dataset collection and policy evaluation configurations.

* Enhance reset configuration and teleoperation event handling

- Added `terminate_on_success` parameter to `ResetConfig` and `InterventionActionProcessor` for controlling episode termination behavior upon success detection.
- Updated documentation to clarify the impact of `terminate_on_success` on data collection for reward classifier training.
- Refactored teleoperation event handling to use `TeleopEvents` constants for improved readability and maintainability across various modules.

* fix(keyboard teleop), delta action keys

* Added transform features and feature contract

* Added transform features for image crop

* Enum for TeleopEvents

* Update tranform_features delta action proc

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

This commit is contained in:

Michel Aractingi

2025-08-11 11:07:55 +02:00

committed by

GitHub

parent fd5d8b3d5f

commit 0053defa2e

17 changed files with 1975 additions and 2251 deletions

									
										3

src/lerobot/scripts/rl/learner.py
									
												View File
												
				@@ -75,6 +75,7 @@ from lerobot.policies.sac.modeling_sac import SACPolicy

				from lerobot.robots import so100_follower  # noqa: F401

				from lerobot.scripts.rl import learner_service

				from lerobot.teleoperators import gamepad, so101_leader  # noqa: F401

				from lerobot.teleoperators.utils import TeleopEvents

				from lerobot.transport import services_pb2_grpc

				from lerobot.transport.utils import (

				    MAX_MESSAGE_SIZE,

				@@ -1174,7 +1175,7 @@ def process_transitions(

				            # Add to offline buffer if it's an intervention

				            if dataset_repo_id is not None and transition.get("complementary_info", {}).get(

				                "is_intervention"

				                TeleopEvents.IS_INTERVENTION

				            ):

				                offline_replay_buffer.add(**transition)

Refactorgym_manipulator.py using the universal pipeline (#1650)

3 src/lerobot/scripts/rl/learner.py Unescape Escape View File

Refactor`gym_manipulator.py` using the universal pipeline (#1650)

3

src/lerobot/scripts/rl/learner.py

View File