aluscher / torchbeastpopart
Deep Learning Project
☆22Updated 5 years ago
Alternatives and similar repositories for torchbeastpopart:
Users that are interested in torchbeastpopart are comparing it to the libraries listed below
- ☆42Updated 2 years ago
- ☆53Updated last year
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆45Updated 4 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 4 years ago
- Simple maze environments using mujoco-py☆54Updated last year
- Advantage weighted Actor Critic for Offline RL☆50Updated 2 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- Source files to replicate experiments in my ICLR 2022 paper.☆70Updated 10 months ago
- A version of the DeepMind Control Suite with randomly generated graphics, for measuring visual generalization in continuous control.☆18Updated 4 years ago
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022☆27Updated last year
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆100Updated 2 years ago
- ☆14Updated 3 years ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆70Updated 11 months ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆80Updated 2 years ago
- PyTorch IMPALA implementation☆26Updated 5 years ago
- ☆55Updated 2 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 3 years ago
- Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"☆22Updated 5 years ago
- ☆17Updated 3 years ago
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]☆52Updated 3 years ago
- Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL☆23Updated 6 months ago
- V-MPO torch version with DMLab30 and GTrXL☆13Updated 4 years ago
- ☆31Updated 4 years ago
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆75Updated 2 years ago
- ☆53Updated 3 years ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆23Updated last year
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆26Updated 3 years ago
- ☆48Updated last year
- Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines☆49Updated 2 years ago
- Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.☆26Updated 3 years ago