aravinho / hexitLinks
Agent to play the game Hex, based on the Expert Iteration from the paper Thinking Fast and Slow with Deep Learning and Tree Search (NIPS 2017)
☆7Updated 6 years ago
Alternatives and similar repositories for hexit
Users that are interested in hexit are comparing it to the libraries listed below
Sorting:
- Neurosymbolic transformers for multi-agent communication.☆22Updated 4 years ago
- Soft Actor-Critic☆147Updated 7 years ago
- ☆41Updated 3 years ago
- Code for "Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills"☆37Updated 5 years ago
- Source code of Neural Logic Reinforcement Learning (https://arxiv.org/abs/1904.10729)☆76Updated 5 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 7 years ago
- ☆61Updated 6 years ago
- An implementation of Emergence of Grounded Compositional Language in Multi-Agent Populations by Igor Mordatch and Pieter Abbeel☆73Updated 7 years ago
- PyTorch Implementation of "Language as an Abstraction for Hierarchical Deep Reinforcement Learning" paper☆26Updated 3 years ago
- Code for reproducing experiments in Model-Based Active Exploration, ICML 2019☆78Updated 5 years ago
- Reinforcement Learning papers on exploration methods.☆19Updated 3 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆28Updated 5 years ago
- Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)☆19Updated 5 years ago
- ☆43Updated 6 years ago
- Safe Option-Critic: Learning Safety in the Option-Critic Architecture☆20Updated 6 years ago
- Code for Sibling Rivalry and experiments presented in associated paper☆17Updated last month
- Diversity−Driven Extensible Hierarchical Reinforcement Learning. AAAI 2019.☆50Updated 6 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆119Updated 4 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆55Updated 6 years ago
- Code implementing the CORE-RL algorithm with DDPG, PPO, and TRPO. See the paper "Control Regularization for Reduced Variance Reinforcemen…☆32Updated 4 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆55Updated 5 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- Meta-Inverse Reinforcement Learning with Probabilistic Context Variables☆73Updated 2 years ago
- Decoupled Reward-free ExplorAtion and Execution for Meta-reinforcement learning☆92Updated 2 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆131Updated last year
- Self-implemented code for Model-Based Meta-Reinforcement Learning☆17Updated 6 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Updated 5 years ago
- RL framework for embodied agents based on PyTorch☆12Updated 6 years ago
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆21Updated 4 years ago
- Mind-aware Multi-agent Management Reinforcement Learning☆82Updated 6 years ago