lerobot-clone

mirror of https://github.com/huggingface/lerobot.git synced 2026-06-02 11:51:25 +00:00

Author	SHA1	Message	Date
pre-commit-ci[bot]	38f5fa4523	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-03-28 17:18:48 +00:00
Michel Aractingi	b29401e4e2	- Refactor observation encoder in `modeling_sac.py` - added `torch.compile` to the actor and learner servers. - organized imports in `train_sac.py` - optimized the parameters push by not sending the frozen pre-trained encoder. Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-03-28 17:18:24 +00:00
Michel Aractingi	ef64ba91d9	Added server directory in `lerobot/scripts` that contains scripts and the protobuf message types to split training into two processes, acting and learning. The actor rollouts the policy and collects interaction data while the learner recieves the data, trains the policy and sends the updated parameters to the actor. The two scripts are ran simultaneously Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-03-28 17:18:24 +00:00
AdilZouitine	83dc00683c	Stable version of rlpd + drq	2025-03-28 17:18:24 +00:00
Adil Zouitine	4b78ab2789	Change SAC policy implementation with configuration and modeling classes	2025-03-28 17:18:24 +00:00
Adil Zouitine	bd8c768f62	SAC works	2025-03-28 17:18:24 +00:00
Adil Zouitine	921ed960fb	Add rlpd tricks	2025-03-28 17:18:24 +00:00
Adil Zouitine	67b64e445b	SAC works	2025-03-28 17:18:24 +00:00
Adil Zouitine	b495b19a6a	[WIP] correct sac implementation	2025-03-28 17:18:24 +00:00