luchris429 / DiscoPOP
Code for Discovering Preference Optimization Algorithms with and for Large Language Models
☆61Updated 10 months ago
Alternatives and similar repositories for DiscoPOP:
Users that are interested in DiscoPOP are comparing it to the libraries listed below
- ☆26Updated 11 months ago
- OMNI: Open-endedness via Models of human Notions of Interestingness☆45Updated 3 months ago
- CycleQD is a framework for parameter space model merging.☆39Updated 3 months ago
- Plug in & Play Pytorch Implementation of the paper: "Evolutionary Optimization of Model Merging Recipes" by Sakana AI☆30Updated 5 months ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆55Updated 2 months ago
- ☆65Updated 3 weeks ago
- Memory Mosaics are networks of associative memories working in concert to achieve a prediction task.☆41Updated 3 months ago
- Code for Discovering Preference Optimization Algorithms with and for Large Language Models☆186Updated 10 months ago
- ☆43Updated last year
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆98Updated 7 months ago
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆60Updated last month
- Collection of LLM completions for reasoning-gym task datasets☆19Updated this week
- An AI benchmark for creative, human-like problem solving using Sudoku variants☆42Updated 2 weeks ago
- ☆49Updated last year
- ☆64Updated 10 months ago
- ☆19Updated last week
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆17Updated 2 months ago
- A repository for research on medium sized language models.☆76Updated 11 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆123Updated last year
- Generative cellular automaton-like learning environments for RL.☆19Updated 3 months ago
- [ICLR 2025] SDTT: a simple and effective distillation method for discrete diffusion models☆24Updated last month
- Train, tune, and infer Bamba model☆115Updated last week
- Learn online intrinsic rewards from LLM feedback☆37Updated 4 months ago
- σ-GPT: A New Approach to Autoregressive Models☆63Updated 8 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆91Updated 2 weeks ago
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆37Updated last year
- Intrinsic Motivation from Artificial Intelligence Feedback☆129Updated last year
- Code for the "Cultural evolution in populations of Large Language Models" paper☆32Updated 6 months ago
- ☆22Updated 6 months ago
- ☆91Updated 10 months ago