qlan3 / MeDQN
The official implementation of Memory-efficient DQN algorithm.
☆10Updated last year
Alternatives and similar repositories for MeDQN:
Users that are interested in MeDQN are comparing it to the libraries listed below
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆22Updated 11 months ago
- Official PyTorch Implementation for Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning☆16Updated 2 years ago
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆25Updated 3 years ago
- Official code for "A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning"☆14Updated 2 years ago
- ☆23Updated 11 months ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆16Updated 3 years ago
- Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"☆19Updated 2 years ago
- [AAMAS 2023] Code for the paper "Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning"☆13Updated last year
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- ☆21Updated 11 months ago
- Implementation of VALOR (Variational Option Discovery Algorithms)☆10Updated 5 years ago
- Model-based reinforcement learning (generative simulator models and planning agents)☆15Updated 3 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆21Updated 3 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Updated 3 years ago
- ☆35Updated 2 years ago
- OpenAI gym environments for goal-conditioned and language-conditioned reinforcement learning☆13Updated 3 weeks ago
- Modular Single-file Reinfocement Learning Algorithms Library☆37Updated last year
- Codebase for BRDiv: Diverse teammate generation for ad hoc teamwork☆13Updated 10 months ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆12Updated last year
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆36Updated 2 years ago
- Implementation of Proximal Policy Optimization in Jax+Flax☆18Updated last year
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 3 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- ☆24Updated 7 months ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 4 years ago
- ☆9Updated 4 years ago
- Scalable Opponent Shaping Experiments in JAX☆24Updated 11 months ago
- ☆22Updated 2 years ago