Code for Discovering Preference Optimization Algorithms with and for Large Language Models
☆192Jun 13, 2024Updated last year
Alternatives and similar repositories for DiscoPOP
Users that are interested in DiscoPOP are comparing it to the libraries listed below
Sorting:
- Code for Discovering Preference Optimization Algorithms with and for Large Language Models☆65Jun 13, 2024Updated last year
- Official repository of Evolutionary Optimization of Model Merging Recipes☆1,403Nov 29, 2024Updated last year
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆349Oct 22, 2024Updated last year
- Efficient baselines for autocurricula in JAX.☆208Aug 24, 2024Updated last year
- ☆12Jul 6, 2024Updated last year
- Your favourite classical machine learning algos on the GPU/TPU☆22Dec 14, 2025Updated 2 months ago
- ☆41Apr 27, 2022Updated 3 years ago
- ☆43Sep 19, 2024Updated last year
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆203Jul 17, 2024Updated last year
- The repository for our paper: Neighboring Perturbations of Knowledge Editing on Large Language Models☆16May 4, 2024Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆92Jan 23, 2025Updated last year
- GROOT: Learning to Follow Instructions by Watching Gameplay Videos (ICLR'24, Spotlight)☆67Dec 18, 2023Updated 2 years ago
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,189Jan 30, 2025Updated last year
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆23May 6, 2025Updated 9 months ago
- Discovering Quality-Diversity Algorithms via Meta-Black-Box Optimization☆20Dec 1, 2025Updated 3 months ago
- σ-GPT: A New Approach to Autoregressive Models☆73Aug 14, 2024Updated last year
- ☆142Aug 20, 2025Updated 6 months ago
- Generative cellular automaton-like learning environments for RL.☆20Jan 30, 2025Updated last year
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆137Jan 31, 2026Updated last month
- Learn how to use logit bias with OpenAI models to create highly-powerful classifiers in minutes.☆34Jun 21, 2023Updated 2 years ago
- The reproduct of the paper - Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction☆22May 29, 2024Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆149Oct 27, 2024Updated last year
- Automating the Search for Artificial Life with Foundation Models!☆450Oct 23, 2025Updated 4 months ago
- Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance…☆156Apr 7, 2025Updated 10 months ago
- look how they massacred my boy☆63Oct 16, 2024Updated last year
- ☆67Mar 30, 2025Updated 11 months ago
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆94Nov 17, 2024Updated last year
- The official implementation of Self-Exploring Language Models (SELM)☆63Jun 4, 2024Updated last year
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬☆12,216Dec 19, 2025Updated 2 months ago
- ☆62Dec 8, 2023Updated 2 years ago
- A byte-level decoder architecture that matches the performance of tokenized Transformers.☆67Apr 24, 2024Updated last year
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆146Oct 19, 2024Updated last year
- For releasing code related to compression methods for transformers, accompanying our publications☆454Jan 16, 2025Updated last year
- ☆138Aug 19, 2024Updated last year
- ☆1,033Dec 17, 2024Updated last year
- ☆130Oct 1, 2024Updated last year
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆176Jan 16, 2025Updated last year
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆234Jul 19, 2025Updated 7 months ago