google-deepmind / alphadevLinks
☆727Updated 2 years ago
Alternatives and similar repositories for alphadev
Users that are interested in alphadev are comparing it to the libraries listed below
Sorting:
- Monte Carlo tree search in JAX☆2,587Updated 5 months ago
- ☆2,812Updated last year
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆924Updated 2 years ago
- Evolution Through Large Models☆737Updated 2 years ago
- Convolutions for Sequence Modeling☆910Updated last year
- ☆551Updated 2 years ago
- Reinforcement learning environments for compiler and program optimization tasks☆992Updated this week
- ☆1,001Updated 2 years ago
- ♟️ Vectorized RL game environments in JAX☆583Updated 11 months ago
- Really Fast End-to-End Jax RL Implementations☆1,017Updated last year
- Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.☆866Updated last year
- Automatic gradient descent☆217Updated 2 years ago
- ☆474Updated 3 months ago
- The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”☆981Updated 2 years ago
- AlphaZero in JAX☆81Updated last year
- [NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.☆775Updated last year
- ☆540Updated 3 years ago
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,860Updated 7 months ago
- Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind☆406Updated 7 months ago
- Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning Framework on a GPU (JMLR 2022)☆499Updated 9 months ago
- Language Modeling with the H3 State Space Model☆522Updated 2 years ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways☆828Updated 3 years ago
- ☆794Updated this week
- Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.☆1,407Updated 9 months ago
- Data and code for the paper "A Neural Network Solves, Explains, and Generates University Math Problems by Program Synthesis and Few-Shot …☆184Updated 3 years ago
- Code for Parsel 🐍 - generate complex programs with language models☆439Updated 2 years ago
- Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)☆1,299Updated last year
- Minimal library to train LLMs on TPU in JAX with pjit().☆301Updated 2 years ago
- Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools, with 10x faster training through evolutionary h…☆873Updated this week
- Unofficial Gato: A Generalist Agent☆219Updated 2 years ago