google-deepmind / alphadevLinks
☆718Updated 2 years ago
Alternatives and similar repositories for alphadev
Users that are interested in alphadev are comparing it to the libraries listed below
Sorting:
- ☆2,788Updated last year
- Monte Carlo tree search in JAX☆2,538Updated 3 weeks ago
- Evolution Through Large Models☆733Updated last year
- ☆933Updated last year
- ☆546Updated last year
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆911Updated last year
- Reinforcement learning environments for compiler and program optimization tasks☆973Updated 11 months ago
- Convolutions for Sequence Modeling☆898Updated last year
- Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.☆841Updated 11 months ago
- ☆916Updated last year
- ☆427Updated 2 months ago
- Automatic gradient descent☆210Updated 2 years ago
- Really Fast End-to-End Jax RL Implementations☆953Updated last year
- ♟️ Vectorized RL game environments in JAX☆524Updated 6 months ago
- ☆501Updated 3 years ago
- [NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.☆771Updated 11 months ago
- Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind☆399Updated 3 months ago
- Code for Parsel 🐍 - generate complex programs with language models☆432Updated 2 years ago
- Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.☆1,602Updated last week
- Language Modeling with the H3 State Space Model☆518Updated last year
- Alex Krizhevsky's original code from Google Code☆198Updated 9 years ago
- ☆866Updated last year
- Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.☆1,387Updated 5 months ago
- AlphaZero in JAX☆78Updated last year
- Minimal library to train LLMs on TPU in JAX with pjit().☆301Updated last year
- Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning Framework on a GPU (JMLR 2022)☆490Updated 4 months ago
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,821Updated 3 months ago
- fast + parallel AlphaZero in JAX☆101Updated 9 months ago
- LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.☆632Updated last month
- The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”☆970Updated last year