lns / memoire
☆18Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for memoire
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 4 years ago
- ☆29Updated 2 years ago
- ☆59Updated 6 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆44Updated 4 years ago
- ☆26Updated 5 years ago
- ☆29Updated 2 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 4 years ago
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Updated last year
- RLA is a tool for managing your RL experiments automatically☆70Updated last year
- Code for Sibling Rivalry and experiments presented in associated paper☆16Updated 3 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- A Multi-agent Learning Framework☆62Updated 3 years ago
- A new paper list for multi-agent reinforcement learning (actively maintained)☆25Updated 4 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆27Updated 5 years ago
- on-policy optimization baselines for deep reinforcement learning☆28Updated 4 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Updated 4 years ago
- FEN Code☆37Updated 5 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆43Updated 2 years ago
- ☆85Updated 3 months ago
- ☆40Updated 3 years ago
- PyTorch IMPALA implementation☆24Updated 5 years ago
- Model-Based Offline Reinforcement Learning☆47Updated 3 years ago
- Implementation of ICML2020 paper <Bidirectional Model-based Policy Optimization>☆23Updated last year
- Implementation of the Off Belief Learning algorithm.☆45Updated 2 years ago
- ☆97Updated 3 years ago
- ☆43Updated last year
- Mirror Descent Policy Optimization☆38Updated 4 years ago
- ☆33Updated 2 months ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆28Updated 5 years ago