Code for Discovering Preference Optimization Algorithms with and for Large Language Models
☆65Jun 13, 2024Updated last year
Alternatives and similar repositories for DiscoPOP
Users that are interested in DiscoPOP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- POPGym Library in JAX☆13Apr 15, 2024Updated 2 years ago
- An Open-Ended Agentic Simulator☆60Aug 11, 2024Updated last year
- Scalable Opponent Shaping Experiments in JAX☆25Apr 13, 2024Updated 2 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆112Dec 5, 2023Updated 2 years ago
- Distributed multi-agent framework for event-driven, graph-based computation. Elixir/Python, NATS event streaming, modular operator/XCS ar…☆14Mar 25, 2026Updated 3 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Jun 15, 2023Updated 2 years ago
- Highly scalable 2D JAX physics engine.☆65Feb 20, 2026Updated last month
- Code for Discovering Preference Optimization Algorithms with and for Large Language Models☆193Jun 13, 2024Updated last year
- Simple JAX Graphics Library.☆37Nov 3, 2024Updated last year
- Generative cellular automaton-like learning environments for RL.☆20Jan 30, 2025Updated last year
- Official codebase for "Sampling For Learnability", published at NeurIPS 2024☆21Oct 21, 2025Updated 5 months ago
- Drop-in environment replacements that make your RL algorithm train faster.☆22Jun 19, 2024Updated last year
- Efficient baselines for autocurricula in JAX.☆211Aug 24, 2024Updated last year
- Reinforcement learning on general 2D physics environments in JAX. ICLR 2025 Oral.☆241Feb 26, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆22May 12, 2025Updated 11 months ago
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).☆75Dec 26, 2024Updated last year
- Repo to reproduce the First-Explore paper results☆39Dec 25, 2024Updated last year
- ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”☆12Jun 3, 2024Updated last year
- Code for "What really matters in matrix-whitening optimizers?"☆23Oct 31, 2025Updated 5 months ago
- Automated Capability Discovery via Foundation Model Self-Exploration☆67Feb 12, 2025Updated last year
- Accelerated minigrid environments with JAX☆164Oct 20, 2025Updated 5 months ago
- ACE (Adaptive Code Evolution) is an AI-powered system for code analysis and optimization.☆12Mar 25, 2026Updated 3 weeks ago
- Repository with environment and training scripts for paper "Cross-Environment-Cooperation Enables Zero-shot Multi-agent Cooperation"☆19Sep 12, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆93Feb 16, 2026Updated 2 months ago
- Neuroevolution Benchmark in JAX 🦕☆42Nov 5, 2023Updated 2 years ago
- Dependency injection for `typer`☆23Jan 12, 2026Updated 3 months ago
- ☆95Jan 21, 2026Updated 2 months ago
- Structured outputs from DSPy and Jinja2☆27Jun 27, 2025Updated 9 months ago
- Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation☆50Jan 1, 2024Updated 2 years ago
- Combining NEAT and novelty search to quickly generate diverse video game levels (GECCO 2022). https://arxiv.org/abs/2204.06934☆16Oct 4, 2022Updated 3 years ago
- Code for the paper Hierarchical WaveFunction Collapse (AIIDE 2023)☆12Mar 24, 2024Updated 2 years ago
- Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!☆264Oct 31, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆10Nov 27, 2023Updated 2 years ago
- ☆122Feb 25, 2025Updated last year
- Official Implementation for "In-Context Reinforcement Learning from Noise Distillation"☆34Sep 18, 2024Updated last year
- Code release for "Training Robots to Evaluate Robots" (CoRL'22, Best Paper Award)☆17Feb 15, 2023Updated 3 years ago
- Code for the "Cultural evolution in populations of Large Language Models" paper☆34Mar 10, 2026Updated last month
- Rethinking the Trust Region in LLM Reinforcement Learning☆51Mar 2, 2026Updated last month
- [NeurIPS 2024] Self-Optimization Improves the Efficiency of Code Generation☆14May 10, 2025Updated 11 months ago