This is AI implementation (not official) of the DreamGym framework from the paper "Scaling Agent Learning via Experience Synthesis" (arXiv:2511.03773).
☆36Nov 9, 2025Updated 3 months ago
Alternatives and similar repositories for DreamGym
Users that are interested in DreamGym are comparing it to the libraries listed below
Sorting:
- Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision☆18Apr 1, 2025Updated 11 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆41Jan 29, 2026Updated last month
- ☆13Aug 12, 2022Updated 3 years ago
- ☆11Jun 18, 2023Updated 2 years ago
- Portfolio with data science and machine learning projects I developed during my training in data science.☆10Jan 4, 2021Updated 5 years ago
- ☆22Dec 11, 2025Updated 2 months ago
- Introduction to Machine Learning using scikit-learn and PyTorch☆10Sep 26, 2019Updated 6 years ago
- ComfyUI-Direct3D‑S2 is now available in ComfyUI, Direct3D‑S2 - Gigascale 3D Generation Made Easy with Spatial Sparse Attention. Direct3D‑…☆16Jun 10, 2025Updated 8 months ago
- ☆10Aug 31, 2021Updated 4 years ago
- Bayesian scaling laws for in-context learning.☆15Mar 12, 2025Updated 11 months ago
- Examples in the MLX framework☆11Sep 23, 2024Updated last year
- ☆13Sep 26, 2024Updated last year
- Prompt-Guided Retrieval For Non-Knowledge-Intensive Tasks☆12Sep 1, 2023Updated 2 years ago
- ☆11Jan 21, 2024Updated 2 years ago
- In the process of my codeing the learning and summary☆12Mar 6, 2019Updated 6 years ago
- UQGAN: A Unified Model for Uncertainty Quantification of Deep Classifiers trained via Conditional GANs☆11Apr 13, 2023Updated 2 years ago
- A list of all papers related to anomaly detection in NeurIPS 2020.☆10Jan 13, 2021Updated 5 years ago
- Flexible and transparent Python Boruta implementation☆15Jun 8, 2025Updated 8 months ago
- A "gym" style toolkit for building lightweight NAS systems.☆13Jun 13, 2022Updated 3 years ago
- Active Learning Helps Pretrained Models Learn the Intended Task (https://arxiv.org/abs/2204.08491) by Alex Tamkin, Dat Nguyen, Salil Desh…☆11Nov 22, 2022Updated 3 years ago
- ☆13Feb 14, 2022Updated 4 years ago
- Diffusing States and Matching Scores: A New Framework for Imitation Learning☆22Nov 16, 2024Updated last year
- ☆10Mar 6, 2022Updated 3 years ago
- 人工智能基础(高中版) 非官方代码☆13May 25, 2021Updated 4 years ago
- ☆12Jul 25, 2023Updated 2 years ago
- ☆26Oct 16, 2025Updated 4 months ago
- Github repository for "Internalizing World Models via Self-Play Finetuning for Agentic RL"☆33Nov 1, 2025Updated 4 months ago
- resources, links for OCR & greek☆10Mar 8, 2021Updated 4 years ago
- Implementation of Stochastic Gradient Descent algorithms in Python (cite https://doi.org/10.1007/s00158-020-02599-z)☆11May 19, 2021Updated 4 years ago
- ☆104Dec 5, 2025Updated 2 months ago
- Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)☆12Jul 7, 2021Updated 4 years ago
- ☆17Dec 22, 2025Updated 2 months ago
- implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies☆13Mar 9, 2021Updated 4 years ago
- [SIGIR 2025] This is the code repo for our SIGIR'25 paper: Enhancing the Patent Matching Capability of Large Language Models via Memory G…☆18Apr 22, 2025Updated 10 months ago
- ☆12Oct 28, 2022Updated 3 years ago
- 常用算法和数据结构Python实现☆10Oct 21, 2016Updated 9 years ago
- ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL (ICLR 2025 Pytorch Code)☆17May 15, 2025Updated 9 months ago
- ☆12Jul 21, 2025Updated 7 months ago
- ☆11Jun 2, 2019Updated 6 years ago