aluscher / torchbeastpopart
Deep Learning Project
☆20Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for torchbeastpopart
- ☆54Updated 8 months ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 4 years ago
- Advantage weighted Actor Critic for Offline RL☆47Updated 2 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- ☆52Updated last year
- ☆38Updated last year
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆26Updated 2 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆99Updated 2 years ago
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆144Updated 3 years ago
- PyTorch IMPALA implementation☆24Updated 5 years ago
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆72Updated 2 years ago
- Simple maze environments using mujoco-py☆52Updated 10 months ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆26Updated 2 years ago
- A set of competitive environments for Reinforcement Learning research.☆28Updated last year
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]☆48Updated 3 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆62Updated 4 months ago
- Author's PyTorch implementation of LAP and PAL with TD3 and DDQN☆34Updated 2 years ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆34Updated last year
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆79Updated last year
- ☆18Updated 2 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆105Updated 2 years ago
- ☆47Updated last year
- Implementation of Truncated Quantile Critics method for continuous reinforcement learning. https://bayesgroup.github.io/tqc/☆90Updated 3 years ago
- ☆110Updated last year
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆33Updated 3 years ago
- Code and project page for D-REX algorithm from the paper "Better-than-Demonstrator Imitation Learning via Automatically-Ranked Demonstrat…☆49Updated last year
- V-MPO torch version with DMLab30 and GTrXL☆12Updated 3 years ago
- ☆28Updated 2 years ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆25Updated last year