codezakh / DataEnvGymLinks

A testbed for agents and environments that can automatically improve models through data generation.

☆27

Alternatives and similar repositories for DataEnvGym

Users that are interested in DataEnvGym are comparing it to the libraries listed below

Sorting:

aszala / EnvGen
Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)
☆34Updated last year
katiekang1998 / reasoning_generalization
☆34Updated 6 months ago
data-for-agents / insta
Official Repo for InSTA: Towards Internet-Scale Training For Agents
☆52Updated 3 weeks ago
Shalev-Lifshitz / MultiAgentVerification
Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers
☆19Updated 5 months ago
LAMDASZ-ML / Self-Backtracking
☆47Updated 5 months ago
facebookresearch / dualformer
implementation of dualformer
☆18Updated 5 months ago
Yu-Fangxu / FoR
[ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples
☆103Updated last week
sail-sg / VeriFree
Reinforcing General Reasoning without Verifiers
☆76Updated last month
amazon-science / PAE
☆60Updated 5 months ago
ablghtianyi / ICL_Modular_Arithmetic
☆19Updated 4 months ago
Agent-E3 / ExACT
☆20Updated 4 months ago
Berkeley-NLP / Agent-Eval-Refine
Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]
☆139Updated 8 months ago
shenao-zhang / SELM
The official implementation of Self-Exploring Language Models (SELM)
☆64Updated last year
likenneth / q_probe
Q-Probe: A Lightweight Approach to Reward Maximization for Language Models
☆41Updated last year
sunblaze-ucb / math_ood
☆39Updated last month
dinobby / MAGDi
The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…
☆36Updated last year
MLE-Dojo / MLE-Dojo
☆61Updated last week
google-deepmind / bbeh
☆85Updated 2 months ago
Gabesarch / ICAL
☆47Updated 2 months ago
hamishivi / automated-instruction-selection
Exploration of automated dataset selection approaches at large scales.
☆47Updated 5 months ago
jwhj / OREO
☆114Updated 6 months ago
hkust-nlp / B-STaR
B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners
☆82Updated 2 months ago
Parallel-Reasoning / APR
[COLM 2025] Code for Paper: Learning Adaptive Parallel Reasoning with Language Models
☆117Updated 3 months ago
csinva / tree-prompt
Tree prompting: easy-to-use scikit-learn interface for improved prompting.
☆39Updated last year
waterhorse1 / Natural-language-RL
Natural Language Reinforcement Learning
☆92Updated last week
mandyyyyii / east
☆20Updated 3 months ago
conglu1997 / intelligent-go-explore
Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models
☆61Updated 5 months ago
ryoungj / BoLT
Code for "Reasoning to Learn from Latent Thoughts"
☆114Updated 4 months ago
Gen-Verse / CURE
Open-Source LLM Coders with Co-Evolving Reinforcement Learning
☆103Updated 2 weeks ago
martin-wey / CodeUltraFeedback
CodeUltraFeedback: aligning large language models to coding preferences
☆71Updated last year