AIDefender / MyDiscorLinks
Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"
☆14Updated 4 years ago
Alternatives and similar repositories for MyDiscor
Users that are interested in MyDiscor are comparing it to the libraries listed below
Sorting:
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆62Updated 2 years ago
- Meta RL codebase for Unstable Baselines☆22Updated 3 years ago
- ☆49Updated 4 years ago
- A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.☆26Updated 5 months ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆112Updated 4 years ago
- Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"☆23Updated 2 years ago
- The implementation of IJCAI'22 paper "Multi-Agent Concentrative Coordination with Decentralized Task Representation".☆18Updated 3 years ago
- Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)☆60Updated last year
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Updated 3 years ago
- ☆51Updated 3 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆146Updated last year
- Distributional Soft Actor Critic☆59Updated 5 years ago
- A collection of offline reinforcement learning algorithms.☆207Updated last year
- The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆46Updated last year
- ☆30Updated 3 years ago
- Robust and safe deep reinforcement learning algorithms☆16Updated last year
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆182Updated 3 years ago
- ☆39Updated 3 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆88Updated 2 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆176Updated last year
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆67Updated 4 years ago
- Model-based Offline Policy Optimization re-implement all by pytorch☆37Updated 2 years ago
- Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.☆15Updated 2 years ago
- Assignments for CS294-112.☆30Updated 6 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆90Updated 5 years ago
- DSAC; Distributional Soft Actor-Critic☆136Updated 10 months ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆93Updated 2 years ago
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆81Updated 2 years ago
- ☆58Updated 2 years ago
- Code for Adapting Environment Sudden Changes by Learning Context Sensitive Policy☆20Updated 3 years ago