luchris429 / DiscoPOPLinks
Code for Discovering Preference Optimization Algorithms with and for Large Language Models
☆63Updated last year
Alternatives and similar repositories for DiscoPOP
Users that are interested in DiscoPOP are comparing it to the libraries listed below
Sorting:
- CycleQD is a framework for parameter space model merging.☆40Updated 5 months ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆60Updated 4 months ago
- The official repository of ALE-Bench☆98Updated this week
- Code for Discovering Preference Optimization Algorithms with and for Large Language Models☆189Updated last year
- A testbed for agents and environments that can automatically improve models through data generation.☆24Updated 4 months ago
- ☆27Updated last year
- Code for the "Cultural evolution in populations of Large Language Models" paper☆32Updated 8 months ago
- Plug in & Play Pytorch Implementation of the paper: "Evolutionary Optimization of Model Merging Recipes" by Sakana AI☆30Updated 8 months ago
- OMNI: Open-endedness via Models of human Notions of Interestingness☆50Updated 5 months ago
- implementation of dualformer☆17Updated 4 months ago
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆55Updated last month
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆103Updated 2 months ago
- A repository for research on medium sized language models.☆77Updated last year
- ☆53Updated last year
- Train, tune, and infer Bamba model☆130Updated last month
- The official implementation of the paper "Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction".☆33Updated last year
- Generative cellular automaton-like learning environments for RL.☆19Updated 5 months ago
- GoldFinch and other hybrid transformer components☆46Updated 11 months ago
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆60Updated 3 months ago
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆105Updated 9 months ago
- Memoria is a human-inspired memory architecture for neural networks.☆74Updated 8 months ago
- a benchmark to evaluate the situated inductive reasoning☆16Updated 6 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated last year
- ☆81Updated last year
- The original Shared Recurrent Memory Transformer implementation☆27Updated last month
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆34Updated last year
- An AI benchmark for creative, human-like problem solving using Sudoku variants☆75Updated 2 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆87Updated 2 weeks ago
- Official repo of paper LM2☆41Updated 5 months ago
- Collection of LLM completions for reasoning-gym task datasets☆26Updated last week