google-deepmind / funsearchLinks
☆880Updated last year
Alternatives and similar repositories for funsearch
Users that are interested in funsearch are comparing it to the libraries listed below
Sorting:
- ☆2,525Updated last year
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,387Updated last year
- GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection☆1,565Updated 7 months ago
- Official repository of Evolutionary Optimization of Model Merging Recipes☆1,329Updated 6 months ago
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,159Updated last year
- Code for Quiet-STaR☆732Updated 9 months ago
- ☆709Updated last year
- Reaching LLaMA2 Performance with 0.1M Dollars☆981Updated 10 months ago
- ☆556Updated last month
- (ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training☆272Updated last year
- A bibliography and survey of the papers surrounding o1☆1,194Updated 6 months ago
- ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting wit…☆1,071Updated last year
- Training LLMs with QLoRA + FSDP☆1,483Updated 6 months ago
- A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.☆2,838Updated last week
- ☆864Updated last year
- Evolution Through Large Models☆722Updated last year
- A simple, performant and scalable Jax LLM!☆1,746Updated this week
- [NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333☆1,108Updated last year
- Our solution for the arc challenge 2024☆144Updated 3 months ago
- ☆1,024Updated 5 months ago
- LLMs as Copilots for Theorem Proving in Lean☆1,094Updated this week
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,784Updated last month
- 800,000 step-level correctness labels on LLM solutions to MATH problems☆2,001Updated 2 years ago
- Open weights language model from Google DeepMind, based on Griffin.☆640Updated this week
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆311Updated 6 months ago
- Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"☆550Updated 5 months ago
- ☆4,506Updated 7 months ago
- Fine-tune LLM agents with online reinforcement learning☆1,191Updated last year
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆851Updated last week
- Must-read Papers on Large Language Model (LLM) as Optimizers and Automatic Optimization for Prompting LLMs.☆244Updated last year