zhihanyang2022 / alpha-zero
Minimal AlphaZero in PyTorch, trained on Connect4 on a 6x6 board.
☆15Updated 2 years ago
Alternatives and similar repositories for alpha-zero:
Users that are interested in alpha-zero are comparing it to the libraries listed below
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated last week
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆36Updated 3 months ago
- Recursive Leasting Squares (RLS) with Neural Network for fast learning☆52Updated last year
- JAX implementation of Graph Attention Networks☆13Updated 2 years ago
- RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback☆10Updated this week
- this is for fun, ain't it grand!☆12Updated 8 months ago
- ☆26Updated last year
- Official implementation of E(n)-equivariant Graph Neural Cellular Automata☆25Updated 8 months ago
- ☆13Updated 2 years ago
- Official repository for the paper "Goal-Conditioned Generators of Deep Policies"☆11Updated 2 years ago
- Understanding RL vision Distill article☆23Updated last year
- ☆13Updated this week
- JAX/Flax implementation of the Hyena Hierarchy☆33Updated last year
- Unofficial PyTorch implementation of "Step-unrolled Denoising Autoencoders for Text Generation"☆23Updated 2 years ago
- Official repository for the paper "Automating Continual Learning"☆12Updated 9 months ago
- Implementation of Metaformer, but in an autoregressive manner☆23Updated 2 years ago
- Implementation of Tranception, an attention network, paired with retrieval, that is SOTA for protein fitness prediction☆31Updated 2 years ago
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆37Updated 2 years ago
- Toy genetic algorithm in Pytorch☆30Updated 10 months ago
- A repository of PyTorch example☆10Updated last year
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks"☆17Updated last week
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated 7 months ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Updated 3 years ago
- Shows how to do parameter ensembling using differential evolution.☆10Updated 3 years ago
- Usable implementation of Mogrifier, a circuit for enhancing LSTMs and potentially other networks, from Deepmind☆17Updated 7 months ago
- ☆23Updated 9 months ago