ServiceNow / TapeAgentsLinks
TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle
☆302Updated last month
Alternatives and similar repositories for TapeAgents
Users that are interested in TapeAgents are comparing it to the libraries listed below
Sorting:
- AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and re…☆509Updated 3 weeks ago
- An agent benchmark with tasks in a simulated software company.☆635Updated 2 months ago
- AWM: Agent Workflow Memory☆389Updated last month
- WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?☆231Updated this week
- Beating the GAIA benchmark with Transformers Agents. 🚀☆146Updated 11 months ago
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆346Updated last year
- ☆223Updated this week
- Attribute (or cite) statements generated by LLMs back to in-context information.☆321Updated last year
- ☆236Updated 3 months ago
- Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike stat…☆427Updated 2 weeks ago
- xLAM: A Family of Large Action Models to Empower AI Agent Systems☆602Updated 5 months ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆583Updated 6 months ago
- Inference-time scaling for LLMs-as-a-judge.☆328Updated 3 months ago
- Code for the paper 🌳 Tree Search for Language Model Agents☆219Updated last year
- Tutorial for building LLM router☆244Updated last year
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆473Updated last year
- Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory☆249Updated 8 months ago
- ☆137Updated 10 months ago
- ☆331Updated 6 months ago
- End-to-end Generative Optimization for AI Agents☆707Updated 2 months ago
- Automatic evals for LLMs☆579Updated last month
- Framework and toolkits for building and evaluating collaborative agents that can work together with humans.☆120Updated 2 months ago
- MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents [EMNLP 2024]☆196Updated 5 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆120Updated 3 months ago
- ⚖️ Awesome LLM Judges ⚖️☆161Updated 9 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆277Updated last year
- A programming framework for agentic AI. Discord: https://discord.gg/pAbnFJrkgZ☆137Updated last year
- [NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?☆136Updated last year
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆153Updated last year
- ☆641Updated 3 months ago