luchris429 / DiscoPOPView external linksLinks
Code for Discovering Preference Optimization Algorithms with and for Large Language Models
☆65Jun 13, 2024Updated last year
Alternatives and similar repositories for DiscoPOP
Users that are interested in DiscoPOP are comparing it to the libraries listed below
Sorting:
- POPGym Library in JAX☆12Apr 15, 2024Updated last year
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Jun 15, 2023Updated 2 years ago
- Distributed multi-agent framework for event-driven, graph-based computation. Elixir/Python, NATS event streaming, modular operator/XCS ar…☆14Nov 4, 2025Updated 3 months ago
- ☆28Apr 22, 2025Updated 9 months ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆112Dec 5, 2023Updated 2 years ago
- Code for Discovering Preference Optimization Algorithms with and for Large Language Models☆192Jun 13, 2024Updated last year
- Scalable Opponent Shaping Experiments in JAX☆25Apr 13, 2024Updated last year
- OpenAI's Code Interpreter running locally, as a service via WebSocket☆10Sep 22, 2023Updated 2 years ago
- ACE (Adaptive Code Evolution) is an AI-powered system for code analysis and optimization.☆12Nov 4, 2025Updated 3 months ago
- Code for the paper: Comparing and Contrasting Deep Learning Weather Prediction Backbones on Navier-Stokes and Atmospheric Dynamics☆13Aug 9, 2024Updated last year
- [NeurIPS 2024] Self-Optimization Improves the Efficiency of Code Generation☆14May 10, 2025Updated 9 months ago
- ☆16Jul 16, 2024Updated last year
- An Open-Ended Agentic Simulator☆58Aug 11, 2024Updated last year
- Efficient baselines for autocurricula in JAX.☆206Aug 24, 2024Updated last year
- Official codebase for "Sampling For Learnability", published at NeurIPS 2024☆20Oct 21, 2025Updated 3 months ago
- ☆15Jun 22, 2022Updated 3 years ago
- Generative cellular automaton-like learning environments for RL.☆20Jan 30, 2025Updated last year
- ☆19Oct 12, 2016Updated 9 years ago
- Code release for "Training Robots to Evaluate Robots" (CoRL'22, Best Paper Award)☆17Feb 15, 2023Updated 3 years ago
- 家計簿金の流れを透明性を持って公開するプラットフォームです☆180Oct 11, 2025Updated 4 months ago
- ☆90Nov 3, 2024Updated last year
- Drop-in environment replacements that make your RL algorithm train faster.☆21Jun 19, 2024Updated last year
- ☆26Mar 11, 2025Updated 11 months ago
- A package for Safe Anytime Valid Inference☆26Nov 4, 2024Updated last year
- ☆17Jul 3, 2017Updated 8 years ago
- Reinforcement learning on general 2D physics environments in JAX. ICLR 2025 Oral.☆229Jan 24, 2026Updated 3 weeks ago
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆94Nov 17, 2024Updated last year
- ☆23Oct 21, 2024Updated last year
- [EMNLP 2024] A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners☆26Dec 11, 2024Updated last year
- Gogoanime and Anilist Scrapper free hosting on cloudflare with tutorial.☆14Mar 2, 2025Updated 11 months ago
- Process Orchestration Framework: A camunda 7 fork☆20Updated this week
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆30Dec 15, 2025Updated last month
- Your favourite classical machine learning algos on the GPU/TPU☆21Dec 14, 2025Updated 2 months ago
- Automated Capability Discovery via Foundation Model Self-Exploration☆66Feb 12, 2025Updated last year
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Oct 12, 2023Updated 2 years ago
- ☆29Jun 24, 2024Updated last year
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).☆73Dec 26, 2024Updated last year
- Structured outputs from DSPy and Jinja2☆27Jun 27, 2025Updated 7 months ago
- Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!☆260Oct 31, 2025Updated 3 months ago