Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike static benchmarks, this platform introduces evolving environments where agents must adapt their strategies as new information becomes available, mirroring real-world challenges.
☆447Jan 23, 2026Updated last month
Alternatives and similar repositories for meta-agents-research-environments
Users that are interested in meta-agents-research-environments are comparing it to the libraries listed below
Sorting:
- ☆24Oct 9, 2025Updated 5 months ago
- A Practitioner's Guide to M(eow)ti Turn Agentic ReinfOrcement learning☆75Jan 16, 2026Updated last month
- ☆28Jun 5, 2025Updated 9 months ago
- CS194-196 Course Project☆14Feb 20, 2025Updated last year
- ☆55Aug 5, 2025Updated 7 months ago
- The original Shared Recurrent Memory Transformer implementation☆33Jul 11, 2025Updated 7 months ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆37Oct 7, 2025Updated 5 months ago
- The official implementation of Preference Data Reward-Augmentation.☆18May 1, 2025Updated 10 months ago
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆39Dec 2, 2025Updated 3 months ago
- Bayes-Adaptive RL for LLM Reasoning☆45May 28, 2025Updated 9 months ago
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,656Updated this week
- A Gym for Agentic LLMs☆455Jan 21, 2026Updated last month
- LIMI: Less is More for Agency☆159Oct 14, 2025Updated 4 months ago
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆549Mar 3, 2026Updated last week
- ☆21May 3, 2025Updated 10 months ago
- A Searching-based Agent Model for Open-Domain Open-Ended Question Answering☆33Jun 20, 2025Updated 8 months ago
- Pipeline parallelism for the minimalist☆41Aug 6, 2025Updated 7 months ago
- [NeurIPS'25 D&B] Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge☆102Feb 28, 2026Updated last week
- ☆11Oct 25, 2024Updated last year
- Simple repository for training small reasoning models☆49Feb 17, 2026Updated 2 weeks ago
- Academic page for LimSim++☆11Mar 19, 2024Updated last year
- verl: Volcano Engine Reinforcement Learning for LLMs☆19,739Updated this week
- Harness for running and evaluating AI agents against RL environments☆120Mar 1, 2026Updated last week
- Async RL Training at Scale☆1,107Updated this week
- Code for the paper 🌳 Tree Search for Language Model Agents☆220Jul 25, 2024Updated last year
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆186May 25, 2025Updated 9 months ago
- τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment☆800Feb 11, 2026Updated 3 weeks ago
- Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"☆69Nov 14, 2024Updated last year
- AllenAI's post-training codebase☆3,614Updated this week
- An agent benchmark with tasks in a simulated software company.☆648Nov 17, 2025Updated 3 months ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆585Aug 10, 2025Updated 6 months ago
- DCPO: Dynamic Adaptive Clipping for RL☆47Dec 20, 2025Updated 2 months ago
- Resa: Transparent Reasoning Models via SAEs☆47Sep 23, 2025Updated 5 months ago
- Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhi…☆737Sep 11, 2025Updated 5 months ago
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆693Aug 5, 2025Updated 7 months ago
- Color detection, Contour mapping, Detecting holes, Motion detection☆10Mar 20, 2014Updated 11 years ago
- ☆23Jul 11, 2025Updated 7 months ago
- ☆14Dec 18, 2024Updated last year
- ☆15May 14, 2025Updated 9 months ago