Code for Discovering Preference Optimization Algorithms with and for Large Language Models
☆197Jun 13, 2024Updated 2 years ago
Alternatives and similar repositories for DiscoPOP
Users that are interested in DiscoPOP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A high-throughput and memory-efficient inference and serving engine for LLMs☆35Mar 21, 2024Updated 2 years ago
- Code for Discovering Preference Optimization Algorithms with and for Large Language Models☆64Jun 13, 2024Updated 2 years ago
- Official repository of Evolutionary Optimization of Model Merging Recipes☆1,425Nov 29, 2024Updated last year
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆358Oct 22, 2024Updated last year
- Your favourite classical machine learning algos on the GPU/TPU☆23Dec 14, 2025Updated 6 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Efficient baselines for autocurricula in JAX.☆213Aug 24, 2024Updated last year
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- ☆16Jul 16, 2024Updated last year
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆205Jul 17, 2024Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Jan 23, 2025Updated last year
- ☆146Aug 20, 2025Updated 9 months ago
- ☆41Apr 27, 2022Updated 4 years ago
- The repository for our paper: Neighboring Perturbations of Knowledge Editing on Large Language Models☆16May 4, 2024Updated 2 years ago
- Automating the Search for Artificial Life with Foundation Models!☆473Oct 23, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The reproduct of the paper - Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction☆21May 29, 2024Updated 2 years ago
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,215Jan 30, 2025Updated last year
- ☆47Mar 30, 2026Updated 2 months ago
- A tiny server to run local inference on MLX model in the style of OpenAI☆13Jan 31, 2024Updated 2 years ago
- Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance…☆156Apr 7, 2025Updated last year
- Review by codex, develop by claude-code☆62Apr 25, 2026Updated last month
- ☆12Sep 1, 2025Updated 9 months ago
- ☆11Feb 9, 2024Updated 2 years ago
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆98Nov 17, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Scalable Opponent Shaping Experiments in JAX☆27Apr 13, 2024Updated 2 years ago
- Drop-in environment replacements that make your RL algorithm train faster.☆22Jun 19, 2024Updated last year
- ☆44Sep 19, 2024Updated last year
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆146Apr 28, 2026Updated last month
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆149Oct 27, 2024Updated last year
- ☆138Aug 19, 2024Updated last year
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆115Dec 5, 2023Updated 2 years ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Oct 1, 2025Updated 8 months ago
- For releasing code related to compression methods for transformers, accompanying our publications☆463Jan 16, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- AMeThyst: Art Metrics Tools for hypothesis test☆26May 13, 2024Updated 2 years ago
- look how they massacred my boy☆63Oct 16, 2024Updated last year
- A byte-level decoder architecture that matches the performance of tokenized Transformers.☆68Apr 24, 2024Updated 2 years ago
- Awesome Open-ended AI☆446May 17, 2026Updated 3 weeks ago
- ☆12Apr 19, 2024Updated 2 years ago
- ☆14Dec 26, 2023Updated 2 years ago
- ☆22Dec 19, 2023Updated 2 years ago