aravinho / hexitLinks

Agent to play the game Hex, based on the Expert Iteration from the paper Thinking Fast and Slow with Deep Learning and Tree Search (NIPS 2017)

☆7

Alternatives and similar repositories for hexit

Users that are interested in hexit are comparing it to the libraries listed below

Sorting:

jinala / multi-agent-neurosym-transformers
Neurosymbolic transformers for multi-agent communication.
☆22Updated 4 years ago
ben-eysenbach / sac
Soft Actor-Critic
☆147Updated 7 years ago
tianjunz / NovelD
☆41Updated 3 years ago
victorcampos7 / edl
Code for "Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills"
☆37Updated 5 years ago
ZhengyaoJiang / NLRL
Source code of Neural Logic Reinforcement Learning (https://arxiv.org/abs/1904.10729)
☆76Updated 5 years ago
jeanharb / a2oc_delib
A3C style Option-Critic with deliberation cost
☆39Updated 7 years ago
Hwhitetooth / lirpg
☆61Updated 6 years ago
bkgoksel / emergent-language
An implementation of Emergence of Grounded Compositional Language in Multi-Agent Populations by Igor Mordatch and Pieter Abbeel
☆73Updated 7 years ago
bhiziroglu / Language-as-an-Abstraction-for-Hierarchical-Deep-Reinforcement-Learning
PyTorch Implementation of "Language as an Abstraction for Hierarchical Deep Reinforcement Learning" paper
☆26Updated 3 years ago
nnaisense / MAX
Code for reproducing experiments in Model-Based Active Exploration, ICML 2019
☆78Updated 5 years ago
seungjaeryanlee / rl-exploration
Reinforcement Learning papers on exploration methods.
☆19Updated 3 years ago
deep-skill-chaining / deep-skill-chaining
Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"
☆28Updated 5 years ago
tgangwani / RL-Indirect-imitation
Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)
☆19Updated 5 years ago
russellmendonca / maesn_suite
☆43Updated 6 years ago
arushijain94 / SafeOptionCritic
Safe Option-Critic: Learning Safety in the Option-Critic Architecture
☆20Updated 6 years ago
salesforce / sibling-rivalry
Code for Sibling Rivalry and experiments presented in associated paper
☆17Updated last month
YuhangSong / DEHRL
Diversity−Driven Extensible Hierarchical Reinforcement Learning. AAAI 2019.
☆50Updated 6 years ago
liuanji / WU-UCT
A novel parallel UCT algorithm with linear speedup and negligible performance loss.
☆119Updated 4 years ago
ermongroup / CalibratedModelBasedRL
Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.
☆55Updated 6 years ago
rcheng805 / CORE-RL
Code implementing the CORE-RL algorithm with DDPG, PPO, and TRPO. See the paper "Control Regularization for Reduced Variance Reinforcemen…
☆32Updated 4 years ago
roosephu / slbo
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
☆55Updated 5 years ago
uber-research / D3G
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Updated 5 years ago
ermongroup / MetaIRL
Meta-Inverse Reinforcement Learning with Probabilistic Context Variables
☆73Updated 2 years ago
ezliu / dream
Decoupled Reward-free ExplorAtion and Execution for Meta-reinforcement learning
☆92Updated 2 years ago
facebookresearch / CollaQ
A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"
☆131Updated last year
Knoxantropicen / model-based-meta-rl
Self-implemented code for Model-Based Meta-Reinforcement Learning
☆17Updated 6 years ago
facebookresearch / slbo
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
☆93Updated 5 years ago
idlrl / flare
RL framework for embodied agents based on PyTorch
☆12Updated 6 years ago
Stanford-ILIAD / ELLA
Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.
☆21Updated 4 years ago
facebookresearch / M3RL
Mind-aware Multi-agent Management Reinforcement Learning
☆82Updated 6 years ago