lucidrains / SAC-pytorch
Implementation of Soft Actor Critic and some of its improvements in Pytorch
☆56Updated 3 months ago
Alternatives and similar repositories for SAC-pytorch
Users that are interested in SAC-pytorch are comparing it to the libraries listed below
Sorting:
- Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch☆117Updated last week
- An implementation of PPO in Pytorch☆79Updated this week
- ☆78Updated 6 months ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆91Updated last month
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆27Updated 2 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆100Updated last year
- Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"☆45Updated last week
- Foundation Policies with Hilbert Representations (ICML 2024)☆85Updated last year
- Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024☆62Updated last year
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆56Updated 7 months ago
- Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024☆66Updated 11 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆112Updated 8 months ago
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning - - —☆69Updated 3 months ago
- Implementation of Diffusion Policy, Toyota Research's supposed breakthrough in leveraging DDPMs for learning policies for real-world Robo…☆107Updated 10 months ago
- ☆82Updated 2 months ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆161Updated last month
- Drop-in environment replacements that make your RL algorithm train faster.☆20Updated 10 months ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆72Updated 8 months ago
- High quality implementations of imitation and inverse reinforcement learning algorithms☆15Updated last month
- The official implementation of flow Q-learning (FQL)☆145Updated 2 months ago
- Unified Implementations of Offline Reinforcement Learning Algorithms☆67Updated 2 weeks ago
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆98Updated 7 months ago
- Selected list of papers on World Models that I found interesting and/or useful.☆21Updated 3 months ago
- Goal-Conditioned Reinforcement Learning with JAX☆154Updated this week
- (NeurIPS 2023) Residual Q-Learning: Offline and Online Policy Customization without Value☆33Updated last year
- PyTorch Package For Quasimetric Learning☆42Updated 6 months ago
- MTM Masked Trajectory Models for Prediction, Representation, and Control.☆152Updated 2 years ago
- On-Policy Policy Gradient Algorithms in JAX☆34Updated last year
- ☆44Updated last month
- Efficient Real-World RL for Legged Locomotion via Adaptive Policy Regularization☆70Updated last year