rungalileo / agent-leaderboard
Ranking LLMs on agentic tasks
☆132Updated 3 weeks ago
Alternatives and similar repositories for agent-leaderboard
Users that are interested in agent-leaderboard are comparing it to the libraries listed below
Sorting:
- Readymade evaluators for agent trajectories☆195Updated 2 weeks ago
- ☆94Updated last month
- Research assistant for performing online research on a given topic, using Llamaindex Workflows and Tavily API. Inspired by GPT-Researcher☆162Updated 7 months ago
- MCP (Model Context Protocol) server for Weaviate☆124Updated 2 months ago
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆124Updated 3 weeks ago
- Build LangGraph agents with large numbers of tools☆279Updated last month
- Building LLM-Enabled Multi Agent Applications with AutoGen☆136Updated last week
- ☆153Updated this week
- 🧍♂️LLM as a manager for approval processes.☆187Updated last month
- ☆123Updated 5 months ago
- AI Engineering bootcamp☆87Updated 2 months ago
- An example of multi-agent orchestration with llama-index☆421Updated 3 months ago
- A practical RAG where you can download and chat with github repo☆79Updated 3 months ago
- Oliva Multi-Agent Assistant☆352Updated last month
- Beating the GAIA benchmark with Transformers Agents. 🚀☆114Updated 2 months ago
- This is the official companion repository for the book The Complete LangGraph Blueprint: Build 50+ AI Agents for Business Success. The re…☆41Updated last month
- Testing and evaluation framework for voice agents☆117Updated 2 weeks ago
- Turn topics into essays in seconds!☆180Updated 3 weeks ago
- A list of AI memory projects☆102Updated 4 months ago
- A repository Payman + Langgraph integration examples that allow AI Agent to simply create tasks for Humans on Payman that pay them money …☆82Updated 7 months ago
- ☆83Updated last week
- Dynamic Metadata based RAG Framework☆75Updated 9 months ago
- SwarmZero's SDK for building AI agents, swarms of agents and much more.☆240Updated 3 months ago
- ☆94Updated 2 months ago
- ☆28Updated this week
- Semantic Chunker is a lightweight Python package for semantically-aware chunking and clustering of text.☆247Updated last month
- ReActMCP is a reactive MCP server that empowers AI assistants to instantly respond with real-time, Markdown-formatted web search insights…☆135Updated last month
- CAMEL framework-based multi-agent system for task-driven and dynamic environments☆92Updated 11 months ago
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆103Updated last year
- Learn to build and customize multi-agent systems using the AutoGen. The course teaches you to implement complex AI applications through a…☆84Updated 11 months ago