lucidrains / SAC-pytorch
Implementation of Soft Actor Critic and some of its improvements in Pytorch
☆51Updated 3 weeks ago
Alternatives and similar repositories for SAC-pytorch:
Users that are interested in SAC-pytorch are comparing it to the libraries listed below
- AgentHive provides the primitives and helpers for a seamless usage of robohive within TorchRL.☆33Updated last year
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆27Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆108Updated 4 months ago
- Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024☆57Updated 7 months ago
- ☆71Updated 2 months ago
- ☆175Updated last month
- [ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data☆56Updated 5 months ago
- ☆69Updated 3 months ago
- Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024☆61Updated 9 months ago
- MTM Masked Trajectory Models for Prediction, Representation, and Control.☆148Updated last year
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆89Updated 3 months ago
- Efficient baselines for autocurricula in JAX.☆175Updated 4 months ago
- Foundation Policies with Hilbert Representations (ICML 2024)☆76Updated 9 months ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆133Updated last month
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆94Updated last year
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆26Updated 2 months ago
- ☆66Updated 4 months ago
- Efficient Real-World RL for Legged Locomotion via Adaptive Policy Regularization☆64Updated last year
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆54Updated 9 months ago
- PyTorch Package For Quasimetric Learning☆41Updated 2 months ago
- Implementation of BC-IRL and other IRL baselines☆25Updated last year
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆19Updated last month
- Open source code for paper "Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning" ICML 2023☆45Updated 8 months ago
- Baselines for gymnax 🤖☆61Updated last year
- Implementation of Diffusion Policy, Toyota Research's supposed breakthrough in leveraging DDPMs for learning policies for real-world Robo…☆100Updated 6 months ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆67Updated 5 months ago
- Code for the paper "Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference"☆39Updated 6 months ago
- Learn online intrinsic rewards from LLM feedback☆33Updated last month
- ☆42Updated 6 months ago
- The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning…☆61Updated last year