Gen-Verse/GenEnv

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Gen-Verse/GenEnv)

Gen-Verse / GenEnv

GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators

☆62

Alternatives and similar repositories for GenEnv

Users that are interested in GenEnv are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

microsoft / Simia-Agent-Training
View on GitHub
Official Implementation of "Simulating Environments with Reasoning Models for Agent Training"
☆65Feb 18, 2026Updated 5 months ago
RUC-NLPIR / EnvScaler
View on GitHub
The official implementation of "EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis".
☆176Feb 12, 2026Updated 5 months ago
jinzhuoran / RAG-RewardBench
View on GitHub
RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
☆18Dec 19, 2024Updated last year
shiweijiezero / R3L
View on GitHub
☆23Apr 5, 2026Updated 3 months ago
X1AOX1A / Word2World
View on GitHub
[ACL 2026 Oral] From Word to World: Can Large Language Models be Implicit Text-based World Models?
☆66Apr 13, 2026Updated 3 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
Trae1ounG / Pretrain_Space_RLVR
View on GitHub
[arxiv: 2604.14142] From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space
☆17Apr 16, 2026Updated 3 months ago
Snowflake-Labs / agent-world-model
View on GitHub
Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning
☆412May 28, 2026Updated last month
rhyang2021 / ARIA
View on GitHub
Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".
☆30Aug 9, 2025Updated 11 months ago
pUmpKin-Co / ComplementaryRL
View on GitHub
Co-evolving policy actors and experience extractors for efficient experience-driven agent RL
☆51May 12, 2026Updated 2 months ago
emrecanacikgoz / Tool-R0
View on GitHub
☆35Apr 3, 2026Updated 3 months ago
TheAgentArk / Toucan
View on GitHub
Official repo of Toucan: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments
☆256Dec 16, 2025Updated 7 months ago
StefanHeng / ProgGen
View on GitHub
Code for paper "ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models"
☆17Mar 29, 2024Updated 2 years ago
Interplay-LM-Reasoning / Interplay-LM-Reasoning
View on GitHub
[ICML 2026 Spotlight] On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models
☆162Jun 8, 2026Updated last month
eigent-ai / toolathlon_gym
View on GitHub
Toolathlon-Gym for testing AI agents real-world tool-use capabilities across diverse MCP servers.
☆139Apr 2, 2026Updated 3 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
neulab / SWE-Playground
View on GitHub
Official Repository for "Training Versatile Coding Agents in Synthetic Environments"
☆22Jan 11, 2026Updated 6 months ago
lblankl / Short-RL
View on GitHub
Short RL
☆19Apr 16, 2026Updated 3 months ago
stellalisy / PrefPalette
View on GitHub
☆21Apr 3, 2026Updated 3 months ago
SteveKGYang / MetaAligner
View on GitHub
Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models
☆24Sep 26, 2024Updated last year
Trae1ounG / BuPO
View on GitHub
[arxiv: 2512.19673] Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies
☆60Feb 6, 2026Updated 5 months ago
intervention-training / int
View on GitHub
☆16Feb 4, 2026Updated 5 months ago
Gen-Verse / Open-AgentRL
View on GitHub
RLAnything (ICML 2026) & AutoTool (ICML 2026), DemyAgent: Open-Source RL for LLMs and Agentic Scenarios
☆588Jun 12, 2026Updated last month
tengxiao1 / MR-Search
View on GitHub
Meta-Reinforcement Learning with Self-Reflection
☆33Mar 26, 2026Updated 3 months ago
ZhaolinGao / REFUEL
View on GitHub
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
☆25Oct 8, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
RyanLiu112 / GenPRM
View on GitHub
[AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".
☆102Nov 8, 2025Updated 8 months ago
FoundationAgents / AutoEnv
View on GitHub
Scaling Agentic Environments Automatically.
☆66Mar 26, 2026Updated 3 months ago
Zayne-sprague / To-CoT-or-not-to-CoT
View on GitHub
☆26Apr 10, 2025Updated last year
Gen-Verse / ReasonFlux
View on GitHub
[NeurIPS 2025 Spotlight] LLM post-training suite — featuring ReasonFlux, ReasonFlux-PRM, and ReasonFlux-Coder.
☆540Sep 27, 2025Updated 9 months ago
UCSB-NLP-Chang / Skill-Usage
View on GitHub
☆46Apr 8, 2026Updated 3 months ago
kanishkg / endless-terminals
View on GitHub
☆134Mar 31, 2026Updated 3 months ago
Zhiyuan-Zeng / RLVE
View on GitHub
[ICML 2026] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
☆225Apr 30, 2026Updated 2 months ago
Tim-Siu / reft-exp
View on GitHub
A research repo for experiments about Reinforcement Finetuning
☆55Apr 7, 2025Updated last year
namezhenzhang / CM2-RLCR-Tool-Agent
View on GitHub
Official Implementation of Papar CM2
☆25Apr 21, 2026Updated 3 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
lukahhcm / Awesome_Environment_Scaling
View on GitHub
Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …
☆71Jan 28, 2026Updated 5 months ago
nick7nlp / FastCuRL
View on GitHub
FastCuRL: Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient LLM Reasoning (EMNLP 2025)
☆61Oct 10, 2025Updated 9 months ago
czp16 / Bridge-LLM-reasoning
View on GitHub
Behavior Injection: Preparing Language Models for Reinforcement Learning (NeurIPS 2025)
☆17Jul 1, 2025Updated last year
PRIME-RL / Entropy-Mechanism-of-RL
View on GitHub
The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.
☆443Jul 11, 2025Updated last year
qhjqhj00 / MemoBrain
View on GitHub
Executive Memory for Coherent Long-Horizon Reasoning!
☆84Jan 14, 2026Updated 6 months ago
LgQu / TIGeR
View on GitHub
Code for paper: Unified Text-to-Image Generation and Retrieval
☆16Updated this week
October2001 / ProLong
View on GitHub
[ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models
☆61Jul 23, 2024Updated last year