tencent-ailab / ArenaLinks
☆10Updated 4 years ago
Alternatives and similar repositories for Arena
Users that are interested in Arena are comparing it to the libraries listed below
Sorting:
- Reinforcement Learning papers on exploration methods.☆19Updated 3 years ago
- Variational Reinforcement Learning☆16Updated 10 months ago
- Ranking Policy Gradient☆23Updated 5 years ago
- ☆17Updated 4 years ago
- mplementation of Advantage Actor Critic (A2C) and Proximal Policy Optimization Algorithm (PPO) use the advantages of Tensorflow 2.x.☆9Updated 5 years ago
- Code for Continual Learning of Control Primitives☆18Updated 4 years ago
- ☆17Updated 3 years ago
- PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021☆16Updated 4 years ago
- Code for Deep Reinforcement and InfoMax Learning (Neurips 2020)☆10Updated 4 years ago
- ☆19Updated 3 years ago
- Official implementation of DynE, Dynamics-aware Embeddings for RL☆43Updated 4 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning (NeurIPS2021)☆20Updated 3 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆36Updated 4 years ago
- Generalised UDRL☆37Updated 3 years ago
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Updated 3 years ago
- Gym wrapper for Vizdoom environments☆12Updated 6 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- A2C for GVG-AI☆21Updated 6 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Updated 2 years ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Updated 4 years ago
- ☆14Updated 5 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆17Updated 3 years ago
- ICRL 2020☆19Updated 5 years ago
- Contextual Bandits Action Elimination DQN☆21Updated 6 years ago
- Open source code for paper "On the Learning and Learnability of Quasimetrics".☆32Updated 2 years ago
- ☆16Updated 4 years ago
- ☆21Updated 6 years ago
- PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Increment…☆19Updated 3 years ago