This is AI implementation (not official) of the DreamGym framework from the paper "Scaling Agent Learning via Experience Synthesis" (arXiv:2511.03773).
☆39Nov 9, 2025Updated 4 months ago
Alternatives and similar repositories for DreamGym
Users that are interested in DreamGym are comparing it to the libraries listed below
Sorting:
- Github repository for "Internalizing World Models via Self-Play Finetuning for Agentic RL"☆33Nov 1, 2025Updated 4 months ago
- Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision☆19Apr 1, 2025Updated 11 months ago
- Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model☆13Feb 11, 2025Updated last year
- Analysis of evidential models☆15Jun 22, 2023Updated 2 years ago
- ☆13Aug 12, 2022Updated 3 years ago
- ☆19Nov 5, 2024Updated last year
- JudgeLRM: Large Reasoning Models as a Judge☆41Jan 29, 2026Updated last month
- [SIGIR 2025] This is the code repo for our SIGIR'25 paper: Enhancing the Patent Matching Capability of Large Language Models via Memory G…☆19Apr 22, 2025Updated 11 months ago
- IMPACT: A Large-scale Integrated Multimodal Patent Analysis and Creation Dataset for Design Patents (NeurIPS 2024)☆16Jul 14, 2025Updated 8 months ago
- Explanation of the llama2 repo.☆12Jul 18, 2024Updated last year
- ☆12Jan 21, 2024Updated 2 years ago
- Official Implementation of "Simulating Environments with Reasoning Models for Agent Training"☆60Feb 18, 2026Updated last month
- (ACL 2025) 🔥🔥🔥Code for "Empowering Multimodal Large Language Models with Evol-Instruct"☆20May 15, 2025Updated 10 months ago
- Test-Time Memory Framework: Control Hallucinations in Foundation Models☆11Nov 4, 2025Updated 4 months ago
- code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning☆44Mar 20, 2024Updated 2 years ago
- Efficiently apply modification functions to RLDS/TFDS datasets.☆29Jun 19, 2024Updated last year
- A comprehensive framework for benchmarking single and multi-agent systems across a wide range of tasks—evaluating performance, accuracy, …☆36Nov 11, 2025Updated 4 months ago
- ☆14Mar 11, 2025Updated last year
- Official Implementation of PatentLMM (our AAAI 2025 Paper)☆17Jan 28, 2025Updated last year
- ☆36Feb 11, 2025Updated last year
- [ACL 2024] Making Long-Context Language Models Better Multi-Hop Reasoners☆19May 28, 2024Updated last year
- [𝐍𝐚𝐭𝐮𝐫𝐞 𝐂𝐨𝐦𝐦𝐮𝐧𝐢𝐜𝐚𝐭𝐢𝐨𝐧𝐬] 🤖💡 LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal C…☆23Mar 8, 2026Updated 2 weeks ago
- The simplest repository for training medium-sized BackpackLM for cs224n☆25Aug 13, 2023Updated 2 years ago
- ☆14Jul 8, 2024Updated last year
- [DL4C @ ICLR 2025] A Benchmark for Automated Environment Setup☆35Nov 9, 2025Updated 4 months ago
- A LLaMA1/LLaMA12 Megatron implement.☆28Dec 13, 2023Updated 2 years ago
- ☆23Jan 2, 2024Updated 2 years ago
- An experiment that applies Google Research's `ReasoningBank` technique to Small Language Models. This experiment hopes to show that the s…☆100Oct 14, 2025Updated 5 months ago
- ☆106Dec 5, 2025Updated 3 months ago
- ☆19Sep 11, 2024Updated last year
- Active Learning Helps Pretrained Models Learn the Intended Task (https://arxiv.org/abs/2204.08491) by Alex Tamkin, Dat Nguyen, Salil Desh…☆11Nov 22, 2022Updated 3 years ago
- SyGra - Graph-oriented Synthetic data generation Pipeline☆75Updated this week
- Bayesian scaling laws for in-context learning.☆15Mar 12, 2025Updated last year
- ☆41May 22, 2025Updated 9 months ago
- LAWLIA is an open-source computational legal framework designed to revolutionize legal reasoning and analysis. It combines the power of l…☆20Dec 6, 2023Updated 2 years ago
- Assisting library for the ML4CV tutorial based on scikit-learn.☆35Nov 2, 2020Updated 5 years ago
- A comprehensive collection of process reward models.☆141Oct 4, 2025Updated 5 months ago
- Code for Dissecting Generation Modes for Abstractive Summarization Models via Ablation and Attribution (ACL2021)☆13Jun 2, 2021Updated 4 years ago
- Code Release for Task Agnostic Dynamics Priors for Deep Reinforcement Learning☆12Jun 13, 2019Updated 6 years ago