Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike static benchmarks, this platform introduces evolving environments where agents must adapt their strategies as new information becomes available, mirroring real-world challenges.
☆483May 4, 2026Updated this week
Alternatives and similar repositories for meta-agents-research-environments
Users that are interested in meta-agents-research-environments are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆26Oct 9, 2025Updated 7 months ago
- ☆60Aug 5, 2025Updated 9 months ago
- A Searching-based Agent Model for Open-Domain Open-Ended Question Answering☆34Jun 20, 2025Updated 10 months ago
- [ICLR 2026] AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆41Apr 17, 2026Updated 3 weeks ago
- Academic page for LimSim++☆11Mar 19, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Practitioner's Guide to M(eow)ti Turn Agentic ReinfOrcement learning☆82Jan 16, 2026Updated 3 months ago
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆39Dec 2, 2025Updated 5 months ago
- [NeurIPS'25 D&B] Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge☆110Feb 28, 2026Updated 2 months ago
- Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhi…☆773Sep 11, 2025Updated 7 months ago
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆19Feb 11, 2025Updated last year
- LIMI: Less is More for Agency☆161Oct 14, 2025Updated 6 months ago
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,806May 3, 2026Updated last week
- Data Augmentation for Supervised Graph Outlier Detection with Latent Diffusion Models☆14Sep 3, 2025Updated 8 months ago
- ☆24Mar 1, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Lean evaluation and metaprogramming utilities for provers.☆91Apr 29, 2026Updated last week
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Dec 29, 2024Updated last year
- ☆11Oct 25, 2024Updated last year
- A Gym for Agentic LLMs☆481Jan 21, 2026Updated 3 months ago
- Single-file truly minimal implementation of state-of-the-art reinforcement learning algorithms.☆21Feb 13, 2023Updated 3 years ago
- AllenAI's post-training codebase☆3,708May 3, 2026Updated last week
- Bayes-Adaptive RL for LLM Reasoning☆46May 28, 2025Updated 11 months ago
- verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework☆21,046Apr 30, 2026Updated last week
- BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions☆25Aug 8, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Our library for RL environments + evals☆4,077Updated this week
- The original Shared Recurrent Memory Transformer implementation☆36Jul 11, 2025Updated 9 months ago
- An enterprise deep research benchmark☆35Apr 22, 2026Updated 2 weeks ago
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆619Updated this week
- Simple repository for training small reasoning models☆50Feb 17, 2026Updated 2 months ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆34Mar 7, 2025Updated last year
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compression☆154Apr 7, 2026Updated last month
- Code for the paper 🌳 Tree Search for Language Model Agents