Michel Aractingi
|
508f5d1407
|
Added server directory in lerobot/scripts that contains scripts and the protobuf message types to split training into two processes, acting and learning. The actor rollouts the policy and collects interaction data while the learner recieves the data, trains the policy and sends the updated parameters to the actor. The two scripts are ran simultaneously
Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>
|
2025-04-18 15:04:13 +02:00 |
|
AdilZouitine
|
c8b1132846
|
Stable version of rlpd + drq
|
2025-04-18 15:04:10 +02:00 |
|
Adil Zouitine
|
760d60ad4b
|
Change SAC policy implementation with configuration and modeling classes
|
2025-04-18 15:03:51 +02:00 |
|
Adil Zouitine
|
875c0271b7
|
SAC works
|
2025-04-18 15:03:51 +02:00 |
|
Adil Zouitine
|
46827fb002
|
Add rlpd tricks
|
2025-04-18 15:03:51 +02:00 |
|
Adil Zouitine
|
2fd78879f6
|
SAC works
|
2025-04-18 15:03:51 +02:00 |
|
Adil Zouitine
|
a0e2be8b92
|
[WIP] correct sac implementation
|
2025-04-18 15:03:51 +02:00 |
|