google-deepmind / alphadev
☆692Updated last year
Related projects ⓘ
Alternatives and complementary repositories for alphadev
- Monte Carlo tree search in JAX☆2,357Updated 3 months ago
- ☆2,688Updated 6 months ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆870Updated 11 months ago
- ☆741Updated 9 months ago
- Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.☆804Updated last month
- Really Fast End-to-End Jax RL Implementations☆724Updated 2 months ago
- ☆318Updated this week
- LLMs as Copilots for Theorem Proving in Lean☆998Updated 2 weeks ago
- If tinygrad wasn't small enough for you...☆654Updated 8 months ago
- ☆845Updated 4 months ago
- Reinforcement learning environments for compiler and program optimization tasks☆914Updated last month
- AlphaZero in JAX☆69Updated 7 months ago
- Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning Framework on a GPU (JMLR 2022)☆464Updated 3 months ago
- Closed-form Continuous-time Neural Networks☆902Updated 4 months ago
- ♟️ Vectorized RL game environments in JAX☆412Updated this week
- Advanced evolutionary computation library built directly on top of PyTorch, created at NNAISENSE.☆1,016Updated this week
- maximal update parametrization (µP)☆1,402Updated 4 months ago
- Tool for data extraction and interacting with Lean programmatically.☆573Updated last month
- Compositional Differentiable Programming Library☆981Updated last week
- 800,000 step-level correctness labels on LLM solutions to MATH problems☆1,667Updated last year
- Pytorch Implementation of MuZero☆343Updated last year
- Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.☆1,481Updated this week
- ☆417Updated 2 years ago
- C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.☆1,095Updated 3 months ago
- ☆367Updated 2 years ago
- Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools.☆592Updated this week
- Reverb is an efficient and easy-to-use data storage and transport system designed for machine learning research☆708Updated 2 weeks ago
- ☆1,263Updated last month
- Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.☆1,304Updated last year
- Data and code for the paper "A Neural Network Solves, Explains, and Generates University Math Problems by Program Synthesis and Few-Shot …☆183Updated last year