facebookresearch / meta-agents-research-environmentsView external linksLinks
Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike static benchmarks, this platform introduces evolving environments where agents must adapt their strategies as new information becomes available, mirroring real-world challenges.
☆436Jan 23, 2026Updated 3 weeks ago
Alternatives and similar repositories for meta-agents-research-environments
Users that are interested in meta-agents-research-environments are comparing it to the libraries listed below
Sorting:
- A Practitioner's Guide to M(eow)ti Turn Agentic ReinfOrcement learning☆75Jan 16, 2026Updated last month
- ☆26Jun 5, 2025Updated 8 months ago
- CS194-196 Course Project☆14Feb 20, 2025Updated 11 months ago
- ☆54Aug 5, 2025Updated 6 months ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆37Oct 7, 2025Updated 4 months ago
- The original Shared Recurrent Memory Transformer implementation☆33Jul 11, 2025Updated 7 months ago
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆39Dec 2, 2025Updated 2 months ago
- The official implementation of Preference Data Reward-Augmentation.☆18May 1, 2025Updated 9 months ago
- A Gym for Agentic LLMs☆446Jan 21, 2026Updated 3 weeks ago
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,571Updated this week
- Bayes-Adaptive RL for LLM Reasoning☆45May 28, 2025Updated 8 months ago
- LIMI: Less is More for Agency☆160Oct 14, 2025Updated 4 months ago
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆525Updated this week
- ☆21May 3, 2025Updated 9 months ago
- A Searching-based Agent Model for Open-Domain Open-Ended Question Answering☆32Jun 20, 2025Updated 7 months ago
- BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions☆25Aug 8, 2024Updated last year
- [NeurIPS'25 D&B] Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge☆100Updated this week
- Simple repository for training small reasoning models☆49Feb 6, 2025Updated last year
- ☆11Oct 25, 2024Updated last year
- τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment☆736Updated this week
- Async RL Training at Scale☆1,071Updated this week
- verl: Volcano Engine Reinforcement Learning for LLMs☆19,132Updated this week
- Code for the paper 🌳 Tree Search for Language Model Agents☆219Jul 25, 2024Updated last year
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆186May 25, 2025Updated 8 months ago
- Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"☆69Nov 14, 2024Updated last year
- AllenAI's post-training codebase☆3,573Updated this week
- An agent benchmark with tasks in a simulated software company.☆639Nov 17, 2025Updated 2 months ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆585Aug 10, 2025Updated 6 months ago
- Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhi…☆723Sep 11, 2025Updated 5 months ago
- DCPO: Dynamic Adaptive Clipping for RL☆45Dec 20, 2025Updated last month
- Resa: Transparent Reasoning Models via SAEs☆47Sep 23, 2025Updated 4 months ago
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆689Aug 5, 2025Updated 6 months ago
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆17Nov 4, 2025Updated 3 months ago
- [ICML 2023] Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills☆12Jul 15, 2023Updated 2 years ago
- Hands-on hub to learn techniques to optimize and serve AI models to production the most optimal way.☆14Aug 20, 2025Updated 5 months ago
- Learning to Skip the Middle Layers of Transformers☆17Aug 7, 2025Updated 6 months ago
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆19Nov 4, 2025Updated 3 months ago
- Color detection, Contour mapping, Detecting holes, Motion detection☆10Mar 20, 2014Updated 11 years ago
- ☆23Jul 29, 2025Updated 6 months ago