lerobot-clone

mirror of https://github.com/huggingface/lerobot.git synced 2026-06-02 11:51:25 +00:00

Author	SHA1	Message	Date
Adil Zouitine	2051dd38fc	[HIL-SERL] Review feedback modifications (#1112 )	2025-05-15 15:24:41 +02:00
Michel Aractingi	69ece1407b	Improved the takeover logic in the case of `leader_automatic` control_mode in `gym_manipulator.py`	2025-05-12 17:47:13 +02:00
Michel Aractingi	b104f8b012	Added number of steps after success as parameter in config	2025-05-09 18:09:10 +02:00
Michel Aractingi	fb9bb89cb4	Fixes in record_dataset and import gym_hil	2025-05-09 12:00:21 +02:00
Michel Aractingi	bdd9229576	robot_type nit	2025-05-07 13:59:21 +02:00
Michel Aractingi	633edcb3af	added names in `record_dataset` function of gym_manipulator	2025-05-07 13:58:24 +02:00
Michel Aractingi	6792c3de8f	Added missing lisences	2025-05-07 10:06:59 +02:00
Adil Zouitine	ad132c9c39	[HIL SERL] Env management and add gym-hil (#1077 ) Co-authored-by: Michel Aractingi <michel.aractingi@gmail.com>	2025-05-07 09:39:21 +02:00
Michel Aractingi	5998203a33	[Port HIL-SERL] Final fixes for reward classifier (#1067 ) Co-authored-by: s1lent4gnt <kmeftah.khalil@gmail.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2025-05-05 11:33:09 +02:00
AdilZouitine	4257fe5045	rename reward classifier	2025-04-25 18:38:52 +02:00
Michel Aractingi	bd4db8d747	[Port HIl-Serl] Refactor gym-manipulator (#1034 )	2025-04-25 16:34:54 +02:00
AdilZouitine	c5845ee203	Fix linter issue	2025-04-22 10:37:08 +02:00
AdilZouitine	a7a51cfc9c	Refactor SACPolicy and configuration to replace 'grasp_critic' terminology with 'discrete_critic'. Update related methods and comments for clarity and consistency in handling discrete actions.	2025-04-18 14:57:03 +00:00
pre-commit-ci[bot]	0d70f0b85c	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-04-18 14:22:11 +00:00
Michel Aractingi	9886520d33	Added option to add current readings to the state of the policy	2025-04-18 16:18:13 +02:00
Michel Aractingi	3b24ad3c84	Fixes for the reward classifier	2025-04-18 16:18:13 +02:00
pre-commit-ci[bot]	fb92935601	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-04-18 13:33:37 +00:00
pre-commit-ci[bot]	28b595c651	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-04-18 15:10:22 +02:00
Michel Aractingi	9fd4c21d4d	General fixes in code, removed delta action, fixed grasp penalty, added logic to put gripper reward in info	2025-04-18 15:10:22 +02:00
Michel Aractingi	0cce2fe0fa	Added Gripper quantization wrapper and grasp penalty removed complementary info from buffer and learner server removed get_gripper_action function added gripper parameters to `common/envs/configs.py`	2025-04-18 15:10:22 +02:00
pre-commit-ci[bot]	88d26ae976	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-04-18 15:10:22 +02:00
s1lent4gnt	ff18be18ad	Add gripper penalty wrapper	2025-04-18 15:10:22 +02:00
Michel Aractingi	8eb3c1510c	Added support for controlling the gripper with the pygame interface of gamepad Minor modifications in gym_manipulator to quantize the gripper actions clamped the observations after F.resize in ConvertToLeRobotObservation wrapper due to a bug in F.resize, images were returned exceeding the maximum value of 1.0	2025-04-18 15:10:22 +02:00
Michel Aractingi	05a237ce10	Added gripper control mechanism to gym_manipulator Moved HilSerl env config to configs/env/configs.py fixes in actor_server and modeling_sac and configuration_sac added the possibility of ignoring missing keys in env_cfg in get_features_from_env_config function	2025-04-18 15:10:22 +02:00
AdilZouitine	88cc2b8fc8	Add WrapperConfig for environment wrappers and update SACConfig properties - Introduced `WrapperConfig` dataclass for environment wrapper configurations. - Updated `ManiskillEnvConfig` to include a `wrapper` field for enhanced environment management. - Modified `SACConfig` to return `None` for `observation_delta_indices` and `action_delta_indices` properties. - Refactored `make_robot_env` function to improve readability and maintainability.	2025-04-18 15:10:22 +02:00
Michel Aractingi	b69132c79d	Change HILSerlRobotEnvConfig to inherit from EnvConfig Added support for hil_serl classifier to be trained with train.py run classifier training by python lerobot/scripts/train.py --policy.type=hilserl_classifier fixes in find_joint_limits, control_robot, end_effector_control_utils	2025-04-18 15:10:21 +02:00
AdilZouitine	db897a1619	[WIP] Update SAC configuration and environment settings - Reduced frame rate in `ManiskillEnvConfig` from 400 to 200. - Enhanced `SACConfig` with new dataclasses for actor, learner, and network configurations. - Improved input and output feature management in `SACConfig`. - Refactored `actor_server` and `learner_server` to access configuration properties directly. - Updated training pipeline to validate configurations and handle dataset repo IDs more robustly.	2025-04-18 15:09:46 +02:00
AdilZouitine	056f79d358	[WIP] Non functional yet Add ManiSkill environment configuration and wrappers - Introduced `VideoRecordConfig` for video recording settings. - Added `ManiskillEnvConfig` to encapsulate environment-specific configurations. - Implemented various wrappers for the ManiSkill environment, including observation and action scaling. - Enhanced the `make_maniskill` function to create a wrapped ManiSkill environment with video recording and observation processing. - Updated the `actor_server` and `learner_server` to utilize the new configuration structure. - Refactored the training pipeline to accommodate the new environment and policy configurations.	2025-04-18 15:09:46 +02:00
Michel Aractingi	114ec644d0	Change config logic in: - gym_manipulator - find_joint_limits - end_effector_utils	2025-04-18 15:09:45 +02:00
pre-commit-ci[bot]	0ea27704f6	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-04-18 15:09:25 +02:00
pre-commit-ci[bot]	1c8daf11fd	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-04-18 15:07:46 +02:00
Michel Aractingi	7b01e16439	Add end effector action space to hil-serl (#861 ) Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2025-04-18 15:06:52 +02:00
Eugene Mironov	e1d55c7a44	[Port HIL-SERL] Adjust Actor-Learner architecture & clean up dependency management for HIL-SERL (#722 )	2025-04-18 15:04:56 +02:00
Michel Aractingi	0d88a5ee09	- Fixed big issue in the loading of the policy parameters sent by the learner to the actor -- pass only the actor to the `update_policy_parameters` and remove `strict=False` - Fixed big issue in the normalization of the actions in the `forward` function of the critic -- remove the `torch.no_grad` decorator in `normalize.py` in the normalization function - Fixed performance issue to boost the optimization frequency by setting the storage device to be the same as the device of learning. Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-04-18 15:04:44 +02:00
AdilZouitine	a90f4872f2	Add maniskill support. Co-authored-by: Michel Aractingi <michel.aractingi@gmail.com>	2025-04-18 15:04:44 +02:00
Michel Aractingi	a16ea283f5	Fixed bug in the action scale of the intervention actions and offline dataset actions. (scale by inverse delta) Co-authored-by: Adil Zouitine <adizouitinegm@gmail.com>	2025-04-18 15:04:44 +02:00
Michel Aractingi	140e30e386	Changed the init_final value to center the starting mean and std of the policy Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-04-18 15:04:43 +02:00
Michel Aractingi	5195f40fd3	Hardcoded some normalization parameters. TODO refactor Added masking actions on the level of the intervention actions and offline dataset Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-04-18 15:04:43 +02:00
Michel Aractingi	ee820859d3	Added logging for interventions to monitor the rate of interventions through time Added an s keyboard command to force success in the case the reward classifier fails Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-04-18 15:04:43 +02:00
Michel Aractingi	5d6879d93a	Added possiblity to record and replay delta actions during teleoperation rather than absolute actions Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-04-18 15:04:42 +02:00
Michel Aractingi	f1af97dc9c	- Added JointMaskingActionSpace wrapper in `gym_manipulator` in order to select which joints will be controlled. For example, we can disable the gripper actions for some tasks. - Added Nan detection mechanisms in the actor, learner and gym_manipulator for the case where we encounter nans in the loop. - changed the non-blocking in the `.to(device)` functions to only work for the case of cuda because they were causing nans when running the policy on mps - Added some joint clipping and limits in the env, robot and policy configs. TODO clean this part and make the limits in one config file only. Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-04-18 15:04:13 +02:00
Michel Aractingi	9784d8a47f	Several fixes to move the actor_server and learner_server code from the maniskill environment to the real robot environment. Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-04-18 15:04:13 +02:00
Michel Aractingi	12c13e320e	- Added `lerobot/scripts/server/gym_manipulator.py` that contains all the necessary wrappers to run a gym-style env around the real robot. - Added `lerobot/scripts/server/find_joint_limits.py` to test the min and max angles of the motion you wish the robot to explore during RL training. - Added logic in `manipulator.py` to limit the maximum possible joint angles to allow motion within a predefined joint position range. The limits are specified in the yaml config for each robot. Checkout the so100.yaml. Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-04-18 15:04:13 +02:00

43 Commits