google-deepmind / alphadev
☆699Updated last year
Alternatives and similar repositories for alphadev:
Users that are interested in alphadev are comparing it to the libraries listed below
- ☆2,708Updated 8 months ago
- Monte Carlo tree search in JAX☆2,411Updated last month
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆878Updated last year
- ☆752Updated 11 months ago
- Evolution Through Large Models☆709Updated last year
- Convolutions for Sequence Modeling☆875Updated 7 months ago
- If tinygrad wasn't small enough for you...☆673Updated 10 months ago
- ☆2,113Updated last year
- Reinforcement learning environments for compiler and program optimization tasks☆928Updated 3 months ago
- Cramming the training of a (BERT-type) language model into limited compute.☆1,309Updated 7 months ago
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,717Updated last month
- Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.☆818Updated 3 months ago
- Data and code for the paper "A Neural Network Solves, Explains, and Generates University Math Problems by Program Synthesis and Few-Shot …☆185Updated 2 years ago
- ☆514Updated 11 months ago
- maximal update parametrization (µP)☆1,430Updated 6 months ago
- MiniLLM is a minimal system for running modern LLMs on consumer-grade GPUs☆888Updated last year
- A machine learning compiler for GPUs, CPUs, and ML accelerators☆2,854Updated this week
- Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate …☆629Updated last year
- Really Fast End-to-End Jax RL Implementations☆797Updated 4 months ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆2,441Updated 5 months ago
- This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neur…☆504Updated last year
- Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)☆1,240Updated last month
- Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackab…☆1,550Updated 11 months ago
- Language Modeling with the H3 State Space Model☆516Updated last year
- ☆863Updated 6 months ago
- C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.☆1,115Updated 5 months ago
- High throughput synchronous and asynchronous reinforcement learning☆857Updated 3 weeks ago
- CodeTF: One-stop Transformer Library for State-of-the-art Code LLM☆1,461Updated 8 months ago
- Mastering Diverse Domains through World Models☆1,456Updated last week
- ☆337Updated this week