google-deepmind / searchless_chess
Grandmaster-Level Chess Without Search
☆572Updated 3 months ago
Alternatives and similar repositories for searchless_chess:
Users that are interested in searchless_chess are comparing it to the libraries listed below
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆204Updated 5 months ago
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆607Updated last month
- ☆241Updated last year
- Teaching transformers to play chess☆121Updated 3 months ago
- Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".☆365Updated 10 months ago
- ☆141Updated last week
- Open weights language model from Google DeepMind, based on Griffin.☆636Updated 2 months ago
- Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).☆251Updated last year
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆123Updated 2 weeks ago
- OpenPipe ART (Agent Reinforcement Trainer): train LLM agents☆318Updated this week
- Animating R1's thoughts.☆380Updated 2 months ago
- DiscoGrad - automatically differentiate across conditional branches in C++ programs☆202Updated 7 months ago
- An interactive HTML pretty-printer for machine learning research in IPython notebooks.☆411Updated last week
- A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private co…☆261Updated last week
- Code behind Arxiv Papers☆515Updated last year
- A pure NumPy implementation of Mamba.☆222Updated 9 months ago
- Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…☆283Updated this week
- Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.☆571Updated 2 months ago
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,769Updated last week
- A BERT that you can train on a (gaming) laptop.☆208Updated last year
- ☆159Updated last month
- Benchmark LLM reasoning capability by solving chess puzzles.☆77Updated last week
- Diffusion on syntax trees for program synthesis☆456Updated 10 months ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆305Updated 6 months ago
- Our solution for the arc challenge 2024☆135Updated 2 months ago
- Felafax is building AI infra for non-NVIDIA GPUs☆559Updated 3 months ago
- LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.☆559Updated 6 months ago
- Run and explore Llama models locally with minimal dependencies on CPU☆189Updated 6 months ago
- A browser-based, WebGL2 implementation of GPT-2 with transform block and attention matrix visualization☆217Updated this week
- Examples and guides for using the VLM Run API☆275Updated this week