Code for Discovering Preference Optimization Algorithms with and for Large Language Models
☆196Jun 13, 2024Updated last year
Alternatives and similar repositories for DiscoPOP
Users that are interested in DiscoPOP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A high-throughput and memory-efficient inference and serving engine for LLMs☆35Mar 21, 2024Updated 2 years ago
- Code for Discovering Preference Optimization Algorithms with and for Large Language Models☆65Jun 13, 2024Updated last year
- Official repository of Evolutionary Optimization of Model Merging Recipes☆1,420Nov 29, 2024Updated last year
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Jun 15, 2023Updated 2 years ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆357Oct 22, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Your favourite classical machine learning algos on the GPU/TPU☆22Dec 14, 2025Updated 5 months ago
- Efficient baselines for autocurricula in JAX.☆213Aug 24, 2024Updated last year
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- ☆16Jul 16, 2024Updated last year
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆205Jul 17, 2024Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Jan 23, 2025Updated last year
- ☆145Aug 20, 2025Updated 9 months ago
- Discovering Quality-Diversity Algorithms via Meta-Black-Box Optimization☆23Dec 1, 2025Updated 5 months ago
- ☆42Apr 27, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12Jul 6, 2024Updated last year
- Automating the Search for Artificial Life with Foundation Models!☆470Oct 23, 2025Updated 7 months ago
- The reproduct of the paper - Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction☆21May 29, 2024Updated last year
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,212Jan 30, 2025Updated last year
- Japanese Language Model Financial Evaluation Harness☆77Feb 18, 2026Updated 3 months ago
- A tiny server to run local inference on MLX model in the style of OpenAI☆13Jan 31, 2024Updated 2 years ago
- Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance…☆156Apr 7, 2025Updated last year
- ☆11Feb 9, 2024Updated 2 years ago
- Generative cellular automaton-like learning environments for RL.☆20Jan 30, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆98Nov 17, 2024Updated last year
- Scalable Opponent Shaping Experiments in JAX☆26Apr 13, 2024Updated 2 years ago
- Drop-in environment replacements that make your RL algorithm train faster.☆22Jun 19, 2024Updated last year
- Neuroevolution Benchmark in JAX 🦕☆43Nov 5, 2023Updated 2 years ago
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆144Apr 28, 2026Updated 3 weeks ago
- The PyTorch Library for LLM Applications.☆16Jul 16, 2024Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆149Oct 27, 2024Updated last year
- ☆138Aug 19, 2024Updated last year
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆115Dec 5, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Oct 1, 2025Updated 7 months ago
- For releasing code related to compression methods for transformers, accompanying our publications☆462Jan 16, 2025Updated last year
- The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬☆13,730Dec 19, 2025Updated 5 months ago
- look how they massacred my boy☆63Oct 16, 2024Updated last year
- A byte-level decoder architecture that matches the performance of tokenized Transformers.☆68Apr 24, 2024Updated 2 years ago
- Awesome Open-ended AI☆438May 17, 2026Updated last week
- ☆12Apr 19, 2024Updated 2 years ago