google-deepmind / alphadevLinks
☆728Updated 2 years ago
Alternatives and similar repositories for alphadev
Users that are interested in alphadev are comparing it to the libraries listed below
Sorting:
- Monte Carlo tree search in JAX☆2,579Updated 4 months ago
- ☆2,799Updated last year
- Evolution Through Large Models☆736Updated 2 years ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆921Updated 2 years ago
- ☆550Updated last year
- ☆991Updated last year
- AlphaZero in JAX☆81Updated last year
- ☆530Updated 3 years ago
- ♟️ Vectorized RL game environments in JAX☆573Updated 10 months ago
- Reinforcement learning environments for compiler and program optimization tasks☆985Updated last year
- Code for Parsel 🐍 - generate complex programs with language models☆439Updated 2 years ago
- Convolutions for Sequence Modeling☆909Updated last year
- Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.☆861Updated last year
- A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games☆167Updated last year
- Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…☆355Updated last year
- Really Fast End-to-End Jax RL Implementations☆1,010Updated last year
- Diffusion on syntax trees for program synthesis☆480Updated last year
- fast + parallel AlphaZero in JAX☆108Updated last year
- Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.☆1,406Updated 8 months ago
- Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind☆405Updated 6 months ago
- ☆1,064Updated last year
- LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.☆667Updated 4 months ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆168Updated 4 years ago
- Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools, with 10x faster training through evolutionary h…☆862Updated this week
- Minimal library to train LLMs on TPU in JAX with pjit().☆301Updated 2 years ago
- Alex Krizhevsky's original code from Google Code☆198Updated 9 years ago
- Automatic gradient descent☆216Updated 2 years ago
- Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning Framework on a GPU (JMLR 2022)☆497Updated 8 months ago
- ☆138Updated last year
- Cost aware hyperparameter tuning algorithm☆177Updated last year