google-deepmind / alphadevLinks
☆716Updated 2 years ago
Alternatives and similar repositories for alphadev
Users that are interested in alphadev are comparing it to the libraries listed below
Sorting:
- ☆2,778Updated last year
- Monte Carlo tree search in JAX☆2,523Updated 4 months ago
- Evolution Through Large Models☆731Updated last year
- Convolutions for Sequence Modeling☆895Updated last year
- Code for Parsel 🐍 - generate complex programs with language models☆432Updated last year
- ☆544Updated last year
- Reinforcement learning environments for compiler and program optimization tasks☆965Updated 10 months ago
- ☆923Updated last year
- Automatic gradient descent☆210Updated 2 years ago
- Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.☆1,380Updated 4 months ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆906Updated last year
- Language Modeling with the H3 State Space Model☆519Updated last year
- Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.☆1,593Updated this week
- Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.☆839Updated 10 months ago
- Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning Framework on a GPU (JMLR 2022)☆488Updated 3 months ago
- ☆491Updated 2 years ago
- AlphaZero in JAX☆78Updated last year
- Data and code for the paper "A Neural Network Solves, Explains, and Generates University Math Problems by Program Synthesis and Few-Shot …☆184Updated 2 years ago
- Tool for data extraction and interacting with Lean programmatically.☆701Updated 2 months ago
- Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind☆394Updated 2 months ago
- ☆912Updated last year
- ☆783Updated 2 months ago
- ♟️ Vectorized RL game environments in JAX☆517Updated 5 months ago
- This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neur…☆541Updated 7 months ago
- The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”☆967Updated last year
- Alex Krizhevsky's original code from Google Code☆196Updated 9 years ago
- Diffusion on syntax trees for program synthesis☆472Updated last year
- fast + parallel AlphaZero in JAX☆96Updated 8 months ago
- ☆423Updated last month
- A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games☆151Updated 10 months ago