Code for Discovering Preference Optimization Algorithms with and for Large Language Models
☆65Jun 13, 2024Updated last year
Alternatives and similar repositories for DiscoPOP
Users that are interested in DiscoPOP are comparing it to the libraries listed below
Sorting:
- POPGym Library in JAX☆12Apr 15, 2024Updated last year
- Distributed multi-agent framework for event-driven, graph-based computation. Elixir/Python, NATS event streaming, modular operator/XCS ar…☆14Nov 4, 2025Updated 4 months ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Jun 15, 2023Updated 2 years ago
- ☆29Apr 22, 2025Updated 10 months ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆112Dec 5, 2023Updated 2 years ago
- Scalable Opponent Shaping Experiments in JAX☆25Apr 13, 2024Updated last year
- ACE (Adaptive Code Evolution) is an AI-powered system for code analysis and optimization.☆12Nov 4, 2025Updated 4 months ago
- [NeurIPS 2024] Self-Optimization Improves the Efficiency of Code Generation☆14May 10, 2025Updated 9 months ago
- Code for the paper: Comparing and Contrasting Deep Learning Weather Prediction Backbones on Navier-Stokes and Atmospheric Dynamics☆14Aug 9, 2024Updated last year
- ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”☆12Jun 3, 2024Updated last year
- ☆16Jul 16, 2024Updated last year
- An Open-Ended Agentic Simulator☆60Aug 11, 2024Updated last year
- Efficient baselines for autocurricula in JAX.☆208Aug 24, 2024Updated last year
- Official codebase for "Sampling For Learnability", published at NeurIPS 2024☆20Oct 21, 2025Updated 4 months ago
- Dependency injection for `typer`☆22Jan 12, 2026Updated last month
- ☆14May 21, 2024Updated last year
- ☆17Dec 23, 2024Updated last year
- ☆15Jun 22, 2022Updated 3 years ago
- MoCo: A One-Stop Shop for Model Collaboration Research☆48Feb 24, 2026Updated last week
- Generative cellular automaton-like learning environments for RL.☆20Jan 30, 2025Updated last year
- Official Codes☆19Oct 26, 2023Updated 2 years ago
- High-performance JAX-powered simulator for robotic navigation in 2D mazes, optimized for Quality-Diversity algorithm research and benchma…☆20Jun 19, 2025Updated 8 months ago
- Code release for "Training Robots to Evaluate Robots" (CoRL'22, Best Paper Award)☆17Feb 15, 2023Updated 3 years ago
- ☆92Feb 16, 2026Updated 2 weeks ago
- Drop-in environment replacements that make your RL algorithm train faster.☆21Jun 19, 2024Updated last year
- ☆26Mar 11, 2025Updated 11 months ago
- ☆22May 12, 2025Updated 9 months ago
- ☆17Jul 3, 2017Updated 8 years ago
- Reinforcement learning on general 2D physics environments in JAX. ICLR 2025 Oral.☆233Feb 26, 2026Updated last week
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆94Nov 17, 2024Updated last year
- [EMNLP 2024] A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners☆26Dec 11, 2024Updated last year
- ☆93Jan 21, 2026Updated last month
- Process Orchestration Framework: A camunda 7 fork☆21Updated this week
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆30Dec 15, 2025Updated 2 months ago
- Accelerated minigrid environments with JAX☆160Oct 20, 2025Updated 4 months ago
- PromptMII: Meta-Learning Instruction Induction for LLMs☆47Jan 12, 2026Updated last month
- Reactive DDD with DSPy☆23Feb 24, 2024Updated 2 years ago
- Your favourite classical machine learning algos on the GPU/TPU☆22Dec 14, 2025Updated 2 months ago
- Highly scalable 2D JAX physics engine.☆63Feb 20, 2026Updated 2 weeks ago