google-deepmind / funsearch
☆770Updated last year
Alternatives and similar repositories for funsearch:
Users that are interested in funsearch are comparing it to the libraries listed below
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,117Updated 9 months ago
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,361Updated 10 months ago
- ☆2,521Updated 9 months ago
- Official repository of Evolutionary Optimization of Model Merging Recipes☆1,278Updated 2 months ago
- GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection☆1,498Updated 3 months ago
- Training LLMs with QLoRA + FSDP☆1,451Updated 3 months ago
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.☆688Updated 5 months ago
- official code for "Large Language Models as Optimizers"☆498Updated 2 months ago
- Must-read Papers on Large Language Model (LLM) as Optimizers and Automatic Optimization for Prompting LLMs.☆235Updated 11 months ago
- Reaching LLaMA2 Performance with 0.1M Dollars☆973Updated 6 months ago
- Code for Quiet-STaR☆713Updated 6 months ago
- ☆718Updated 8 months ago
- A family of open-sourced Mixture-of-Experts (MoE) Large Language Models☆1,446Updated 11 months ago
- LLM verified with Monte Carlo Tree Search☆264Updated 2 weeks ago
- A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.☆2,766Updated last week
- End-to-end Generative Optimization for AI Agents☆477Updated this week
- Beyond Language Models: Byte Models are Digital World Simulators☆318Updated 8 months ago
- ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting wit…☆1,031Updated 11 months ago
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆803Updated last week
- Convolutions for Sequence Modeling☆876Updated 8 months ago
- Representation Engineering: A Top-Down Approach to AI Transparency☆789Updated 6 months ago
- [ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"☆730Updated 6 months ago
- Reference implementation of Megalodon 7B model☆516Updated 10 months ago
- A bibliography and survey of the papers surrounding o1☆1,155Updated 3 months ago
- A simple, performant and scalable Jax LLM!☆1,623Updated this week
- Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"☆547Updated last month
- [NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333☆1,083Updated last year
- ☆499Updated 6 months ago
- LLMs as Copilots for Theorem Proving in Lean☆1,040Updated 2 weeks ago