This is AI implementation (not official) of the DreamGym framework from the paper "Scaling Agent Learning via Experience Synthesis" (arXiv:2511.03773).
☆43Nov 9, 2025Updated 7 months ago
Alternatives and similar repositories for DreamGym
Users that are interested in DreamGym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Github repository for "Internalizing World Models via Self-Play Finetuning for Agentic RL"☆35Nov 1, 2025Updated 7 months ago
- ☆18Mar 2, 2026Updated 3 months ago
- Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision☆19Apr 1, 2025Updated last year
- Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model☆13Feb 11, 2025Updated last year
- ☆13Aug 12, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A complete Undertale Mod Tool for Android☆23Updated this week
- ☆20Nov 5, 2024Updated last year
- Group-relative Trajectory-based Policy Optimization: Increasing Quality and Training Stability☆40Feb 23, 2026Updated 4 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆42May 6, 2026Updated last month
- ☆13Sep 26, 2024Updated last year
- [SIGIR '25] This is the code repo for our SIGIR '25 paper: Enhancing the Patent Matching Capability of Large Language Models via Memory G…☆19Apr 22, 2025Updated last year
- ☆14May 20, 2022Updated 4 years ago
- Explanation of the llama2 repo.☆13Jul 18, 2024Updated last year
- Perplexity style AI answer engine for AI PCs with CPU,GPU and NPU support☆50Mar 1, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆13Jan 21, 2024Updated 2 years ago
- IMPACT: A Large-scale Integrated Multimodal Patent Analysis and Creation Dataset for Design Patents (NeurIPS 2024)☆18Jul 14, 2025Updated 11 months ago
- [ICLR 2025] "GraphEval: A Lightweight Graph-Based LLM Framework for Idea Evaluation", Tao Feng, Yihang Sun, Jiaxuan You☆18Mar 18, 2025Updated last year
- An exploration of artificial intelligence, with the help of math, history and Python☆18Nov 30, 2017Updated 8 years ago
- Official Implementation of "Simulating Environments with Reasoning Models for Agent Training"☆65Feb 18, 2026Updated 4 months ago
- (ACL 2025) 🔥🔥🔥Code for "Empowering Multimodal Large Language Models with Evol-Instruct"☆21May 15, 2025Updated last year
- code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning☆45Mar 20, 2024Updated 2 years ago
- Implements the Messenger environment and EMMA model.☆25Jun 14, 2023Updated 3 years ago
- ☆13Mar 11, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Efficiently apply modification functions to RLDS/TFDS datasets.☆32Jun 19, 2024Updated 2 years ago
- [ACL 2024] Making Long-Context Language Models Better Multi-Hop Reasoners☆20May 28, 2024Updated 2 years ago
- ☆37Feb 11, 2025Updated last year
- [𝐍𝐚𝐭𝐮𝐫𝐞 𝐂𝐨𝐦𝐦𝐮𝐧𝐢𝐜𝐚𝐭𝐢𝐨𝐧𝐬] 🤖💡 LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal C…☆29Apr 21, 2026Updated 2 months ago
- [DL4C @ ICLR 2025] A Benchmark for Automated Environment Setup☆37Nov 9, 2025Updated 7 months ago
- ☆15Jul 8, 2024Updated last year
- ☆24Jan 2, 2024Updated 2 years ago
- An experiment that applies Google Research's `ReasoningBank` technique to Small Language Models. This experiment hopes to show that the s…☆107Oct 14, 2025Updated 8 months ago
- ☆10Feb 27, 2020Updated 6 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆11Jun 2, 2019Updated 7 years ago
- Active Learning Helps Pretrained Models Learn the Intended Task (https://arxiv.org/abs/2204.08491) by Alex Tamkin, Dat Nguyen, Salil Desh…☆11Nov 22, 2022Updated 3 years ago
- Code for the paper "Semi-Conditional Normalizing Flows for Semi-Supervised Learning"☆11Mar 30, 2020Updated 6 years ago
- Bayesian scaling laws for in-context learning.☆16Mar 12, 2025Updated last year
- LAWLIA is an open-source computational legal framework designed to revolutionize legal reasoning and analysis. It combines the power of l…☆23Dec 6, 2023Updated 2 years ago
- Assisting library for the ML4CV tutorial based on scikit-learn.☆35Nov 2, 2020Updated 5 years ago
- Dreamer 4 jax implementation☆94Nov 28, 2025Updated 7 months ago