zhihanyang2022 / alpha-zeroLinks
Minimal AlphaZero in PyTorch, trained on Connect4 on a 6x6 board.
☆16Updated 3 years ago
Alternatives and similar repositories for alpha-zero
Users that are interested in alpha-zero are comparing it to the libraries listed below
Sorting:
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆39Updated 3 years ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated this week
- Toy genetic algorithm in Pytorch☆53Updated 4 months ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated 2 years ago
- Train a production grade GPT in less than 400 lines of code. Better than Karpathy's verison and GIGAGPT☆16Updated last week
- Minimum Description Length probing for neural network representations☆18Updated 7 months ago
- This project shows how to derive the total number of training tokens from a large text dataset from 🤗 datasets with Apache Beam and Data…☆27Updated 2 years ago
- Understanding RL vision Distill article☆24Updated 2 years ago
- Simple illustrative examples for energy-based models in PyTorch☆66Updated 5 years ago
- Implementations of growing and pruning in neural networks☆22Updated 2 years ago
- Implementation of Tranception, an attention network, paired with retrieval, that is SOTA for protein fitness prediction☆32Updated 3 years ago
- Local Attention - Flax module for Jax☆22Updated 4 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆50Updated 3 years ago
- ☆31Updated 2 years ago
- Implementation of Metaformer, but in an autoregressive manner☆27Updated 3 years ago
- Implementation of VQ-VAE with a GPT-style sampler in the JAX and Haiku ecosystem.☆12Updated last year
- ☆66Updated 5 months ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Updated 3 years ago
- Fast reinforcement learning 💨☆26Updated last month
- Implementation of a holodeck, written in Pytorch☆18Updated last year
- [COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training☆17Updated 10 months ago
- ☆26Updated 2 years ago
- Implementation of numerous Vision Transformers in Google's JAX and Flax.☆22Updated 3 years ago
- A simple implementation of a deep linear Pytorch module☆21Updated 4 years ago
- Code for running the experiments in Deep Subjecthood: Higher Order Grammatical Features in Multilingual BERT☆17Updated 2 years ago
- PyTorch implementation of Soft MoE by Google Brain in "From Sparse to Soft Mixtures of Experts" (https://arxiv.org/pdf/2308.00951.pdf)☆77Updated last year
- Implements EvoNorms B0 and S0 as proposed in Evolving Normalization-Activation Layers.☆11Updated 5 years ago
- Implementation of the Triangle Multiplicative module, used in Alphafold2 as an efficient way to mix rows or columns of a 2d feature map, …☆35Updated 4 years ago
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆36Updated 11 months ago
- Alpha Zero equipped with Transformer with various novel techniques for speedup in tree search☆27Updated 6 years ago