facebookresearch / meta-agents-research-environmentsLinks
Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike static benchmarks, this platform introduces evolving environments where agents must adapt their strategies as new information becomes available, mirroring real-world challenges.
☆418Updated last week
Alternatives and similar repositories for meta-agents-research-environments
Users that are interested in meta-agents-research-environments are comparing it to the libraries listed below
Sorting:
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆255Updated 8 months ago
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆344Updated 3 weeks ago
- 🌍 AppWorld: A Controllable World of Apps and People for Benchmarking Function Calling and Interactive Coding Agent, ACL'24 Best Resource…☆362Updated 2 months ago
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents☆532Updated this week
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆189Updated 10 months ago
- A Gym for Agentic LLMs☆437Updated last week
- ☆216Updated last week
- AWM: Agent Workflow Memory☆387Updated last month
- [ICLR 2026] Learning to Reason without External Rewards☆388Updated this week
- Framework and toolkits for building and evaluating collaborative agents that can work together with humans.☆120Updated last month
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆624Updated 6 months ago
- [COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents☆229Updated 6 months ago
- ☆328Updated 6 months ago
- Code for the paper 🌳 Tree Search for Language Model Agents☆218Updated last year
- ☆227Updated 11 months ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆166Updated 3 months ago
- ☆321Updated 4 months ago
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…☆570Updated 4 months ago
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆344Updated last month
- Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory☆243Updated 8 months ago
- (ACL 2025 Main) Code for MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents https://www.arxiv.org/pdf/2503.019…☆209Updated 3 months ago
- Harbor is a framework for running agent evaluations and creating and using RL environments.☆488Updated this week
- 🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.☆621Updated 3 weeks ago
- Reproducible, flexible LLM evaluations☆331Updated this week
- ☆117Updated last year
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"☆273Updated 3 months ago
- AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and re…☆504Updated last week
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆357Updated this week
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]☆389Updated last year
- A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning☆283Updated 4 months ago