ServiceNow / TapeAgentsLinks
TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle
☆279Updated this week
Alternatives and similar repositories for TapeAgents
Users that are interested in TapeAgents are comparing it to the libraries listed below
Sorting:
- AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and re…☆348Updated this week
- AWM: Agent Workflow Memory☆275Updated 4 months ago
- WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?☆190Updated last week
- 🌎💪 BrowserGym, a Gym environment for web task automation☆776Updated last week
- Code for the paper 🌳 Tree Search for Language Model Agents☆201Updated 11 months ago
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆345Updated last year
- Code and Data for Tau-Bench☆609Updated 5 months ago
- xLAM: A Family of Large Action Models to Empower AI Agent Systems☆465Updated last week
- ☆211Updated last month
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆173Updated 3 months ago
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆215Updated last month
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆519Updated this week
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆219Updated last month
- Beating the GAIA benchmark with Transformers Agents. 🚀☆123Updated 4 months ago
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆184Updated this week
- An agent benchmark with tasks in a simulated software company.☆397Updated last week
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆486Updated last month
- ☆127Updated 3 months ago
- SkyRL-v0: Train Real-World Long-Horizon Agents via Reinforcement Learning☆422Updated this week
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆551Updated 3 months ago
- Automatic evals for LLMs☆437Updated 2 weeks ago
- ⚖️ The First Coding Agent-as-a-Judge☆558Updated last month
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆206Updated this week
- ☆207Updated 4 months ago
- Framework and toolkits for building and evaluating collaborative agents that can work together with humans.☆84Updated 2 months ago
- End-to-end Generative Optimization for AI Agents☆605Updated last week
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering☆760Updated this week
- Attribute (or cite) statements generated by LLMs back to in-context information.☆240Updated 8 months ago
- [NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?☆125Updated 9 months ago
- Scale your LLM-as-a-judge.☆240Updated 2 weeks ago