felix-kerkhoff / DQfDLinks
An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games
☆32Updated 3 years ago
Alternatives and similar repositories for DQfD
Users that are interested in DQfD are comparing it to the libraries listed below
Sorting:
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆112Updated 4 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Updated 3 years ago
- Collection of OpenAI parametrized action-space environments.☆66Updated 8 months ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆41Updated 5 years ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆171Updated last year
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆107Updated 3 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆90Updated 5 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆176Updated last year
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆55Updated 4 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆137Updated 3 months ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆146Updated 6 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆52Updated 6 months ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆105Updated 5 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆141Updated last year
- Transformer in RL for decision-making☆103Updated 2 years ago
- MADDPG in Ray/RLlib☆54Updated 5 years ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆61Updated 5 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆181Updated 3 years ago
- ☆40Updated 3 years ago
- DSAC; Distributional Soft Actor-Critic☆134Updated 9 months ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆70Updated last year
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆88Updated 2 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆93Updated last year
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆65Updated 4 years ago
- ☆40Updated 4 years ago
- A collection of offline reinforcement learning algorithms.☆207Updated last year
- The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆46Updated last year
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆221Updated last year
- ☆49Updated 4 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆51Updated 3 years ago