Ktakuya332C / deepcube
An implementation of the paper "Solving the Rubik's Cube without Human Knowledge"
☆12Updated 6 years ago
Alternatives and similar repositories for deepcube:
Users that are interested in deepcube are comparing it to the libraries listed below
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆81Updated 2 years ago
- Solving the Rubik's cube with deep reinforcement learning and Monte Carlo tree search☆98Updated 5 years ago
- Improving upon state of the art cooperative deep reinforcement learning in StarCraft II☆13Updated 5 years ago
- PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]☆35Updated 3 years ago
- Using self-play, MCTS, and a deep neural network to create a hearthstone ai player☆29Updated 6 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆74Updated 5 years ago
- The code for experiments conducted to verify the correctness of mirror learning.☆11Updated 2 years ago
- Framework for inspecting actions and observatinos in StarCraftII replays☆20Updated 6 years ago
- MiniZero: An AlphaZero and MuZero Training Framework☆76Updated last month
- Fictitious Self-play & Reinforcement Learning☆19Updated 6 years ago
- [ICML 2021] DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning☆30Updated last year
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆44Updated 2 years ago
- Using Deep Reinforcement Learning, a computer program learns how to solve the Rubik's Cube, the world's most popular toy.☆18Updated 6 years ago
- ☆11Updated 2 years ago
- Deep reinforcement learning of mahjong self-play☆17Updated 6 years ago
- Learning-based agent for Google Research Football (足球游戏智能体)☆111Updated last year
- ☆12Updated 3 years ago
- Efficient Reinforcement Learning with a Thought-Game for StarCraft☆46Updated 2 years ago
- Code for "Joint Policy Search for Collaborative Multi-agent Incomplete Information Games"☆50Updated last year
- ☆19Updated 3 years ago
- A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.☆12Updated 4 years ago
- This is the source code of Agar.io environment.☆23Updated 3 years ago
- ☆15Updated 11 months ago
- An implementation of Phasic Policy Gradient, a proposed improvement of Proximal Policy Gradients, in Pytorch☆51Updated 2 weeks ago
- Implementation of Deepmind's AlphaZero algorithm with Caffe and C++☆19Updated 6 years ago
- Parallel Monte Carlo Tree Search, see README.md for more detailed usage and information.☆43Updated 4 years ago
- ☆39Updated last year
- FQF(Fully parameterized Quantile Function for distributional reinforcement learning) is a general reinforcement learning framework for At…☆41Updated 4 years ago
- ☆18Updated 5 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆51Updated last year