This is AI implementation (not official) of the DreamGym framework from the paper "Scaling Agent Learning via Experience Synthesis" (arXiv:2511.03773).
☆40Nov 9, 2025Updated 5 months ago
Alternatives and similar repositories for DreamGym
Users that are interested in DreamGym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Github repository for "Internalizing World Models via Self-Play Finetuning for Agentic RL"☆35Nov 1, 2025Updated 6 months ago
- ☆18Mar 2, 2026Updated last month
- Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision☆19Apr 1, 2025Updated last year
- Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model☆13Feb 11, 2025Updated last year
- ☆27Aug 16, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Self-Questioning Language Models☆56Mar 30, 2026Updated last month
- ☆19Nov 5, 2024Updated last year
- JudgeLRM: Large Reasoning Models as a Judge☆41Apr 7, 2026Updated 3 weeks ago
- ☆12Dec 4, 2021Updated 4 years ago
- ☆13Sep 26, 2024Updated last year
- ☆14May 20, 2022Updated 3 years ago
- ☆12Jan 21, 2024Updated 2 years ago
- A Python toolkit for the OmniLabel benchmark providing code for evaluation and visualization☆23Feb 1, 2025Updated last year
- [ICLR 2025] "GraphEval: A Lightweight Graph-Based LLM Framework for Idea Evaluation", Tao Feng, Yihang Sun, Jiaxuan You☆18Mar 18, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official Implementation of "Simulating Environments with Reasoning Models for Agent Training"☆63Feb 18, 2026Updated 2 months ago
- (ACL 2025) 🔥🔥🔥Code for "Empowering Multimodal Large Language Models with Evol-Instruct"☆22May 15, 2025Updated 11 months ago
- code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning☆44Mar 20, 2024Updated 2 years ago
- Removing rain from single images via a deep detail network (achieved by others)☆14Mar 28, 2018Updated 8 years ago
- ☆13Mar 11, 2025Updated last year
- Efficiently apply modification functions to RLDS/TFDS datasets.☆31Jun 19, 2024Updated last year
- Official Implementation of PatentLMM (our AAAI 2025 Paper)☆20Jan 28, 2025Updated last year
- GOP3 Blackjack Bot☆27Sep 26, 2025Updated 7 months ago
- [ACL 2024] Making Long-Context Language Models Better Multi-Hop Reasoners☆20May 28, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆17Dec 10, 2021Updated 4 years ago
- [𝐍𝐚𝐭𝐮𝐫𝐞 𝐂𝐨𝐦𝐦𝐮𝐧𝐢𝐜𝐚𝐭𝐢𝐨𝐧𝐬] 🤖💡 LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal C…☆26Apr 21, 2026Updated last week
- A LLaMA1/LLaMA12 Megatron implement.☆28Dec 13, 2023Updated 2 years ago
- ☆24Jan 2, 2024Updated 2 years ago
- Active Learning Helps Pretrained Models Learn the Intended Task (https://arxiv.org/abs/2204.08491) by Alex Tamkin, Dat Nguyen, Salil Desh…☆11Nov 22, 2022Updated 3 years ago
- This is a framework for evaluating reasoning in foundational Video Models.☆89Apr 16, 2026Updated 2 weeks ago
- Bayesian scaling laws for in-context learning.☆15Mar 12, 2025Updated last year
- 首个全参数训练的知识产权大模型 MoZi (墨子)☆26Aug 20, 2024Updated last year
- LAWLIA is an open-source computational legal framework designed to revolutionize legal reasoning and analysis. It combines the power of l…☆22Dec 6, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Assisting library for the ML4CV tutorial based on scikit-learn.☆35Nov 2, 2020Updated 5 years ago
- ☆32Jan 27, 2022Updated 4 years ago
- 公开的知识图谱探索项目☆14Jul 9, 2020Updated 5 years ago
- Code for Dissecting Generation Modes for Abstractive Summarization Models via Ablation and Attribution (ACL2021)☆13Jun 2, 2021Updated 4 years ago
- Code repository for our paper presenting the L3D dataset.☆24Dec 14, 2021Updated 4 years ago
- ☆10Jan 28, 2024Updated 2 years ago
- Data augmentation using OpenCV☆11Jan 12, 2017Updated 9 years ago