lukahhcm/Awesome_Environment_Scaling

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lukahhcm/Awesome_Environment_Scaling)

lukahhcm / Awesome_Environment_Scaling

Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to agent intelligence.

☆72

Alternatives and similar repositories for Awesome_Environment_Scaling

Users that are interested in Awesome_Environment_Scaling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

YangHaolin0526 / MARS-SQL
View on GitHub
☆43Dec 19, 2025Updated 7 months ago
VovyH / FreeKnowledge_AI
View on GitHub
[2025AIAgent / 2025InternLab]An agent that provides free and flexible access to Search external knowledge.
☆23Feb 18, 2026Updated 5 months ago
hkust-nlp / deepsearch-tts
View on GitHub
Pushing Test-Time Scaling Limits of Deep Search with Asymmetric Verification
☆21Oct 8, 2025Updated 9 months ago
jinzhuoran / RAG-RewardBench
View on GitHub
RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
☆18Dec 19, 2024Updated last year
DeepExperience / agent2world
View on GitHub
🪐 Agent2World: Learning to Generate Symbolic World Models via Adaptive Multi-Agent Feedback
☆23Jan 29, 2026Updated 6 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
RUC-NLPIR / EnvScaler
View on GitHub
The official implementation of "EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis".
☆179Feb 12, 2026Updated 5 months ago
liushulinle / UloRL
View on GitHub
An Ultra-Long Output Reinforcement Learning Approach
☆23Jul 31, 2025Updated 11 months ago
hemingkx / Whisper
View on GitHub
[ACL 2026] Enabling Efficient Reasoning in LLMs via Black-box Persuasive Prompting
☆22Jan 9, 2026Updated 6 months ago
JiayuJeff / CostBench
View on GitHub
The official code repository for the paper "CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments…
☆34Jun 14, 2026Updated last month
FoundationAgents / AutoEnv
View on GitHub
Scaling Agentic Environments Automatically.
☆66Mar 26, 2026Updated 4 months ago
vlm2-bench / VLM2-Bench
View on GitHub
VLM2-Bench [ACL 2025 Main]: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues
☆45May 20, 2025Updated last year
TheAgentArk / Toucan
View on GitHub
Official repo of Toucan: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments
☆260Dec 16, 2025Updated 7 months ago
axon-rl / gem
View on GitHub
A Gym for Agentic LLMs
☆503Jan 21, 2026Updated 6 months ago
WxxShirley / Agent-STAR
View on GitHub
Official implementation for paper "Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe"
☆32May 12, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
sheep333c / DIVE
View on GitHub
DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use
☆28Mar 13, 2026Updated 4 months ago
wwh0411 / MCP-Flow
View on GitHub
[ACL 2026 Main] MCP-Flow: Facilitating LLM Agents to Master Real-World, Diverse and Scaling MCP Tools.
☆25Apr 8, 2026Updated 3 months ago
plageon / HierSearch
View on GitHub
HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searches
☆41Oct 9, 2025Updated 9 months ago
RUC-NLPIR / ET-Agent
View on GitHub
☆20Jan 18, 2026Updated 6 months ago
YuyaoZhangQAQ / QCompiler
View on GitHub
This repository contains the code for the paper “Neuro-Symbolic Query Compiler”, accepted to the Findings of ACL 2025.
☆17Oct 20, 2025Updated 9 months ago
eigent-ai / toolathlon_gym
View on GitHub
Toolathlon-Gym for testing AI agents real-world tool-use capabilities across diverse MCP servers.
☆140Jul 22, 2026Updated last week
MDI-Benchmark / MDI-Benchmark
View on GitHub
☆14Dec 18, 2024Updated last year
DeepExperience / REAL
View on GitHub
Rewards as Labels: Revisiting RLVR from a Classification Perspective
☆24Jun 26, 2026Updated last month
LuckyTiger123 / GPF
View on GitHub
The code Implementation of the paper “Universal Prompt Tuning for Graph Neural Networks”.
☆26Oct 16, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
RUC-NLPIR / GISA
View on GitHub
GISA: A Benchmark for General Information-Seeking Assistant
☆36Mar 20, 2026Updated 4 months ago
HKUST-KnowComp / LLM-Multistep-Jailbreak
View on GitHub
Code for Findings-EMNLP 2023 paper: Multi-step Jailbreaking Privacy Attacks on ChatGPT
☆37Oct 15, 2023Updated 2 years ago
uservan / ThinkPO
View on GitHub
☆17Aug 1, 2025Updated 11 months ago
Evanwu1125 / LiteCoT
View on GitHub
☆17Jun 10, 2025Updated last year
hkust-nlp / Toolathlon
View on GitHub
[ICLR 2026] The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution
☆441Updated this week
weiyifan1023 / AutoTIR
View on GitHub
Code and Data for Paper "AutoTIR: Autonomous Tools Integrated Reasoning via Reinforcement Learning"
☆54Sep 4, 2025Updated 10 months ago
hkust-nlp / AgentVista
View on GitHub
Benchmarking multimodal agents on realistic, ultra-challenging visual scenarios requiring long-horizon hybrid tool use.
☆68Mar 10, 2026Updated 4 months ago
Zhitao-He / AgentsCourt
View on GitHub
AgentsCourt: Building Judicial Decision-Making Agents with Court Debate Simulation and Legal Knowledge Augmentation (EMNLP 2024 Findings)
☆18Dec 30, 2024Updated last year
qhjqhj00 / awesome-agentic-search
View on GitHub
🔍 Awesome Agentic Search is a curated list of papers, tools, and resources on agentic search—where AI agents plan, search, and reason to…
☆60Aug 28, 2025Updated 11 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
microsoft / Simia-Agent-Training
View on GitHub
Official Implementation of "Simulating Environments with Reasoning Models for Agent Training"
☆65Feb 18, 2026Updated 5 months ago
jinzhuoran / MiNer
View on GitHub
A Good Neighbor, A Found Treasure: Mining Treasured Neighbors for Knowledge Graph Entity Typing. EMNLP 2022
☆11Feb 1, 2023Updated 3 years ago
Fu-Dayuan / AgentRefine
View on GitHub
(ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning
☆20Nov 22, 2025Updated 8 months ago
EIT-NLP / Distilling-CoT-Reasoning
View on GitHub
[ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".
☆22Feb 26, 2025Updated last year
ZhuYun97 / ENGINE
View on GitHub
Official implementation of paper "Efficient Tuning and Inference for Large Language Models on Textual Graphs"
☆38Jun 24, 2024Updated 2 years ago
WangHanLinHenry / SPA-RL-Agent
View on GitHub
Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"
☆89Sep 13, 2025Updated 10 months ago
G-JWLee / TAMP
View on GitHub
☆12May 15, 2025Updated last year