This is AI implementation (not official) of the DreamGym framework from the paper "Scaling Agent Learning via Experience Synthesis" (arXiv:2511.03773).
☆41Nov 9, 2025Updated 6 months ago
Alternatives and similar repositories for DreamGym
Users that are interested in DreamGym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Github repository for "Internalizing World Models via Self-Play Finetuning for Agentic RL"☆35Nov 1, 2025Updated 6 months ago
- ☆18Mar 2, 2026Updated 2 months ago
- Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision☆19Apr 1, 2025Updated last year
- Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model☆13Feb 11, 2025Updated last year
- JudgeLRM: Large Reasoning Models as a Judge☆42May 6, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13Sep 26, 2024Updated last year
- ☆14May 20, 2022Updated 4 years ago
- Explanation of the llama2 repo.☆12Jul 18, 2024Updated last year
- ☆12Jan 21, 2024Updated 2 years ago
- IMPACT: A Large-scale Integrated Multimodal Patent Analysis and Creation Dataset for Design Patents (NeurIPS 2024)☆18Jul 14, 2025Updated 10 months ago
- [ICLR 2025] "GraphEval: A Lightweight Graph-Based LLM Framework for Idea Evaluation", Tao Feng, Yihang Sun, Jiaxuan You☆18Mar 18, 2025Updated last year
- ☆12Jul 21, 2025Updated 10 months ago
- Official Implementation of "Simulating Environments with Reasoning Models for Agent Training"☆65Feb 18, 2026Updated 3 months ago
- code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning☆44Mar 20, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆13Mar 11, 2025Updated last year
- A comprehensive framework for benchmarking single and multi-agent systems across a wide range of tasks—evaluating performance, accuracy, …☆38Nov 11, 2025Updated 6 months ago
- Efficiently apply modification functions to RLDS/TFDS datasets.☆32Jun 19, 2024Updated last year
- Official Implementation of PatentLMM (our AAAI 2025 Paper)☆23Jan 28, 2025Updated last year
- [𝐍𝐚𝐭𝐮𝐫𝐞 𝐂𝐨𝐦𝐦𝐮𝐧𝐢𝐜𝐚𝐭𝐢𝐨𝐧𝐬] 🤖💡 LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal C…☆26Apr 21, 2026Updated last month
- [DL4C @ ICLR 2025] A Benchmark for Automated Environment Setup☆35Nov 9, 2025Updated 6 months ago
- ☆15Jul 8, 2024Updated last year
- ☆24Jan 2, 2024Updated 2 years ago
- ☆110Dec 5, 2025Updated 5 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- An experiment that applies Google Research's `ReasoningBank` technique to Small Language Models. This experiment hopes to show that the s…☆105Oct 14, 2025Updated 7 months ago
- ☆11Jun 2, 2019Updated 6 years ago
- Active Learning Helps Pretrained Models Learn the Intended Task (https://arxiv.org/abs/2204.08491) by Alex Tamkin, Dat Nguyen, Salil Desh…☆11Nov 22, 2022Updated 3 years ago
- Code for the paper "Semi-Conditional Normalizing Flows for Semi-Supervised Learning"☆11Mar 30, 2020Updated 6 years ago
- Bayesian scaling laws for in-context learning.☆15Mar 12, 2025Updated last year
- LAWLIA is an open-source computational legal framework designed to revolutionize legal reasoning and analysis. It combines the power of l…☆22Dec 6, 2023Updated 2 years ago
- A comprehensive collection of process reward models.☆152Oct 4, 2025Updated 7 months ago
- Code Release for Task Agnostic Dynamics Priors for Deep Reinforcement Learning☆12Jun 13, 2019Updated 6 years ago
- Code for Dissecting Generation Modes for Abstractive Summarization Models via Ablation and Attribution (ACL2021)☆13Jun 2, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code repository for our paper presenting the L3D dataset.☆24Dec 14, 2021Updated 4 years ago
- 首个全参数训练的知识产权大模型 MoZi (墨子)☆27Aug 20, 2024Updated last year
- An MCP server implementation providing a standardized interface for LLMs to interact with the Atla API.☆18Jul 21, 2025Updated 10 months ago
- Examples in the MLX framework☆11Sep 23, 2024Updated last year
- ☆10Jan 28, 2024Updated 2 years ago
- ☆13Feb 14, 2022Updated 4 years ago
- ☆31Mar 5, 2025Updated last year