Code for Discovering Preference Optimization Algorithms with and for Large Language Models
☆65Jun 13, 2024Updated last year
Alternatives and similar repositories for DiscoPOP
Users that are interested in DiscoPOP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- POPGym Library in JAX☆12Apr 15, 2024Updated last year
- Scalable Opponent Shaping Experiments in JAX☆25Apr 13, 2024Updated last year
- Distributed multi-agent framework for event-driven, graph-based computation. Elixir/Python, NATS event streaming, modular operator/XCS ar…☆14Nov 4, 2025Updated 4 months ago
- ☆16Jul 16, 2024Updated last year
- Highly scalable 2D JAX physics engine.☆64Feb 20, 2026Updated last month
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Code for Discovering Preference Optimization Algorithms with and for Large Language Models☆193Jun 13, 2024Updated last year
- Generative cellular automaton-like learning environments for RL.☆20Jan 30, 2025Updated last year
- Drop-in environment replacements that make your RL algorithm train faster.☆21Jun 19, 2024Updated last year
- Efficient baselines for autocurricula in JAX.☆209Aug 24, 2024Updated last year
- Reinforcement learning on general 2D physics environments in JAX. ICLR 2025 Oral.☆236Feb 26, 2026Updated last month
- ☆22May 12, 2025Updated 10 months ago
- Reinforcement Learning inside a 3D soccer simulation☆37Sep 15, 2024Updated last year
- Repo to reproduce the First-Explore paper results☆39Dec 25, 2024Updated last year
- ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”☆12Jun 3, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for "What really matters in matrix-whitening optimizers?"☆23Oct 31, 2025Updated 4 months ago
- Automated Capability Discovery via Foundation Model Self-Exploration☆67Feb 12, 2025Updated last year
- Accelerated minigrid environments with JAX☆163Oct 20, 2025Updated 5 months ago
- ACE (Adaptive Code Evolution) is an AI-powered system for code analysis and optimization.☆12Nov 4, 2025Updated 4 months ago
- Repository with environment and training scripts for paper "Cross-Environment-Cooperation Enables Zero-shot Multi-agent Cooperation"☆19Sep 12, 2025Updated 6 months ago
- Syphus: Automatic Instruction-Response Generation Pipeline☆14Dec 14, 2023Updated 2 years ago
- Dependency injection for `typer`☆23Jan 12, 2026Updated 2 months ago
- MoCo: A One-Stop Shop for Model Collaboration Research☆51Feb 24, 2026Updated last month
- Structured outputs from DSPy and Jinja2☆27Jun 27, 2025Updated 9 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation☆50Jan 1, 2024Updated 2 years ago
- Combining NEAT and novelty search to quickly generate diverse video game levels (GECCO 2022). https://arxiv.org/abs/2204.06934☆16Oct 4, 2022Updated 3 years ago
- Code for the paper Hierarchical WaveFunction Collapse (AIIDE 2023)☆12Mar 24, 2024Updated 2 years ago
- Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!☆265Oct 31, 2025Updated 4 months ago
- ☆10Nov 27, 2023Updated 2 years ago
- ☆122Feb 25, 2025Updated last year
- Official Implementation for "In-Context Reinforcement Learning from Noise Distillation"☆34Sep 18, 2024Updated last year
- Code release for "Training Robots to Evaluate Robots" (CoRL'22, Best Paper Award)☆17Feb 15, 2023Updated 3 years ago
- ☆11Apr 3, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for the "Cultural evolution in populations of Large Language Models" paper☆34Mar 10, 2026Updated 2 weeks ago
- Rethinking the Trust Region in LLM Reinforcement Learning☆50Mar 2, 2026Updated 3 weeks ago
- [NeurIPS 2024] Self-Optimization Improves the Efficiency of Code Generation☆14May 10, 2025Updated 10 months ago
- World-Gymnast: Training Robots with Reinforcement Learning in a World Model☆30Feb 11, 2026Updated last month
- Code for the papers Hypernetworks in Meta-Reinforcement Learning (Beck et al., 2022) and Recurrent Hypernetworks are Surprisingly Strong …☆17Jul 31, 2024Updated last year
- Official implementation of Categorical Flow Maps on text.☆48Feb 16, 2026Updated last month
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Oct 12, 2023Updated 2 years ago