zhihanyang2022 / alpha-zeroLinks
Minimal AlphaZero in PyTorch, trained on Connect4 on a 6x6 board.
☆16Updated 2 years ago
Alternatives and similar repositories for alpha-zero
Users that are interested in alpha-zero are comparing it to the libraries listed below
Sorting:
- Alpha Zero equipped with Transformer with various novel techniques for speedup in tree search☆27Updated 6 years ago
- Efficiently send large arrays across machines☆16Updated 11 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated this week
- Adversarial examples to the new ConvNeXt architecture☆20Updated 3 years ago
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆36Updated 9 months ago
- The open source implementation of the base model behind GPT-4 from OPENAI [Language + Multi-Modal]☆10Updated last year
- Load any clip model with a standardized interface☆21Updated last year
- Repo from the "Learning with limited labeled data" seminar @ Uni of Tuebingen. A collection of notes, notebooks and slideshows to underst…☆17Updated 2 years ago
- ☆13Updated 10 months ago
- Shows how to do parameter ensembling using differential evolution.☆10Updated 3 years ago
- JAX implementation of Graph Attention Networks☆13Updated 3 years ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Updated 3 years ago
- ☆26Updated 2 years ago
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Updated 5 months ago
- Understanding RL vision Distill article☆23Updated 2 years ago
- Directed masked autoencoders☆14Updated 2 years ago
- A regression-alike loss to improve numerical reasoning in language models☆17Updated 3 weeks ago
- ☆23Updated 6 months ago
- A dashboard for exploring timm learning rate schedulers☆19Updated 7 months ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆50Updated 3 years ago
- This repo consists all my RL work and learnings☆11Updated 3 years ago
- implementation of dualformer☆17Updated 3 months ago
- Pytorch implementation of StyleGAN2 in my style☆11Updated 2 years ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated 2 years ago
- Minimum Description Length probing for neural network representations☆18Updated 5 months ago
- Usable implementation of Mogrifier, a circuit for enhancing LSTMs and potentially other networks, from Deepmind☆19Updated last year
- Implementation of Metaformer, but in an autoregressive manner☆25Updated 3 years ago
- ☆18Updated last year
- ☆12Updated 3 years ago
- Jax implementation of x-LSTM: Extended Long Short-Term Memory by Beck et al. (2024)☆17Updated 10 months ago