YuriCat/MuZeroJupyterExample

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/YuriCat/MuZeroJupyterExample)

YuriCat / MuZeroJupyterExample

☆66

Alternatives and similar repositories for MuZeroJupyterExample

Users that are interested in MuZeroJupyterExample are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wulfebw / muzero
View on GitHub
A python implemenation of tabular MuZero for educational purposes
☆21Dec 11, 2019Updated 6 years ago
johan-gras / MuZero
View on GitHub
A structured implementation of MuZero
☆205Jun 4, 2022Updated 4 years ago
Zeta36 / muzero
View on GitHub
A simple implementation of MuZero algorithm for connect4 game
☆96Aug 11, 2020Updated 5 years ago
koulanurag / muzero-pytorch
View on GitHub
Pytorch Implementation of MuZero
☆356Jul 23, 2023Updated 3 years ago
jvarsoke / ictk
View on GitHub
Internet Chess ToolKit is a java based set of libraries and widgets useful for performing common tasks such as reading PGN, FEN, and gene…
☆12Feb 22, 2017Updated 9 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
tambetm / gymexperiments
View on GitHub
☆28Apr 28, 2019Updated 7 years ago
Zeta36 / Asynchronous-Methods-for-Deep-Reinforcement-Learning
View on GitHub
Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain …
☆84Mar 4, 2016Updated 10 years ago
tuero / muzero-cpp
View on GitHub
A C++ pytorch implementation of MuZero
☆40May 18, 2026Updated 2 months ago
only-changer / GeneraLight
View on GitHub
☆12Aug 15, 2020Updated 5 years ago
calclavia / relay-generator
View on GitHub
The architecture used to train the level generator in the game Relay.
☆12Apr 8, 2017Updated 9 years ago
kaloureyes3 / v4-clients
View on GitHub
☆10Apr 5, 2024Updated 2 years ago
facebookresearch / rela
View on GitHub
Reinforcement Learning Assembly
☆94Sep 2, 2021Updated 4 years ago
asonabend / ESRL
View on GitHub
Code for Expert Supervised Reinforcement Learning
☆10Apr 7, 2021Updated 5 years ago
rarilurelo / BinaryNetConvolution
View on GitHub
Implement BinaryNet of CNN with chainer
☆11May 5, 2016Updated 10 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Grzego / async-rl
View on GitHub
Variation of "Asynchronous Methods for Deep Reinforcement Learning" with multiple processes generating experience for agent (Keras + Thea…
☆44Feb 27, 2018Updated 8 years ago
DTaoo / DMC
View on GitHub
Code for Deep Multimodal Clustering for Unsupervised Audiovisual Learning (CVPR2019)
☆15May 27, 2020Updated 6 years ago
zameyer1 / Evolutionary-Trading-Strategies
View on GitHub
This code illustrates the use of genetic programming to evolve financial trading strategies for a single equity stock. Individuals (strat…
☆25Feb 24, 2019Updated 7 years ago
BlueScottie / SUMO-Carla-integration
View on GitHub
☆12Mar 29, 2023Updated 3 years ago
YushuoLi / Gato-A-Generalist-Agent
View on GitHub
Minimal code for A Generalist Agent
☆44Nov 4, 2022Updated 3 years ago
5vision / uct_atari
View on GitHub
uct tree search + supervised lerning for atari games
☆12Feb 14, 2017Updated 9 years ago
jaimeyzzz / impala_horovod_gym
View on GitHub
☆10Sep 20, 2018Updated 7 years ago
wenhuchen / GPT2-Logic2Text
View on GitHub
The code for Template-GPT-2 Generation Model for Logic2Text Dataset
☆18Jun 1, 2020Updated 6 years ago
fabienbaradel / cophy
View on GitHub
"CoPhy: Counterfactual Learning of Physical Dynamics", F. Baradel, N. Neverova, J. Mille, G. Mori, C. Wolf, ICLR'2020
☆36Apr 28, 2020Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
werner-duvaud / muzero-general
View on GitHub
MuZero
☆2,843Sep 3, 2024Updated last year
UesugiErii / tf2-PPO-atari
View on GitHub
Use tensorflow2 achieve PPO to play atari game
☆13Oct 25, 2019Updated 6 years ago
Nightaway / TinyCompiler
View on GitHub
Tiny语言编译器
☆11Sep 2, 2023Updated 2 years ago
DaRL-LibSignal / OpenTI
View on GitHub
IJMLC: Open-TI: Open Traffic Intelligence with Augmented Language Model
☆23Jul 30, 2025Updated 11 months ago
TARTRL / TiZero
View on GitHub
Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体
☆14May 25, 2023Updated 3 years ago
behaviorguidedRL / BGRL
View on GitHub
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Jun 24, 2020Updated 6 years ago
nlml / MoCo
View on GitHub
Momentum Contrast for Unsupervised Visual Representation Learning
☆16Mar 24, 2023Updated 3 years ago
benediamond / leela-chess
View on GitHub
A chess adaption of GCP's Leela Zero
☆14Jan 9, 2018Updated 8 years ago
georgesung / deep_rl_acrobot
View on GitHub
TensorFlow A2C to solve Acrobot, with synchronized parallel environments
☆35Apr 21, 2018Updated 8 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
cedricpinson / vox2mesh
View on GitHub
Convert .vox to .obj
☆14Nov 24, 2018Updated 7 years ago
che-shr-cat / alphago
View on GitHub
Code to recreate AlphaGo Zero models
☆19Mar 24, 2023Updated 3 years ago
HanHuCAS / FaceIlluminationNormalization
View on GitHub
MATLAB code for "PR2013 - A Comparative Study on Illumination Preprocessing in Face Recognition"
☆13Aug 13, 2016Updated 9 years ago
sak2km / OnlineLearningToRank
View on GitHub
☆13May 11, 2021Updated 5 years ago
elise-ng / COMP4901J_Project
View on GitHub
Mahjong Tile Image Classification with Denoising CAE and CNN
☆14May 15, 2019Updated 7 years ago
rlglab / minizero
View on GitHub
[IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework
☆136Jul 17, 2026Updated last week
johnno1962 / ProfileSwiftUI
View on GitHub
InstrumentSwiftUI
☆11Jul 19, 2024Updated 2 years ago