rungalileo / agent-leaderboardLinks
Ranking LLMs on agentic tasks
☆138Updated this week
Alternatives and similar repositories for agent-leaderboard
Users that are interested in agent-leaderboard are comparing it to the libraries listed below
Sorting:
- Readymade evaluators for agent trajectories☆230Updated 2 weeks ago
- Terminal-based AI Coding Agent, similar to Claude Code, OpenAI Codex etc. but works with many more LLMs e.g. Gemini, Groq, Deepseek☆130Updated last month
- ☆104Updated 2 months ago
- A list of AI memory projects☆108Updated 4 months ago
- MCP (Model Context Protocol) server for Weaviate☆128Updated 2 weeks ago
- AI Engineering bootcamp☆90Updated 2 months ago
- ☆86Updated 3 weeks ago
- ☆173Updated 3 months ago
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆132Updated last month
- ☆187Updated this week
- Agentic RAG to help you build a startup🚀☆44Updated 2 months ago
- A bot with memory, built on LangGraph Cloud.☆120Updated 10 months ago
- Building LLM-Enabled Multi Agent Applications with AutoGen☆147Updated last week
- ☆66Updated 10 months ago
- ☆99Updated 8 months ago
- ☆122Updated 3 months ago
- Testing and evaluation framework for voice agents☆121Updated this week
- An AI Clone For Any X Profile☆82Updated 5 months ago
- ☆92Updated 2 months ago
- Dynamic Metadata based RAG Framework☆75Updated 10 months ago
- GenAIOps on Kubernetes: A collection of reference architectures for running GenAI at scale on Kubernetes using OSS tooling☆130Updated 7 months ago
- ☆101Updated 3 months ago
- A tutorial on how to use Model Context Protocol by Anthropic and Agent2Agent Protocol by Google☆62Updated last month
- Learn to build and customize multi-agent systems using the AutoGen. The course teaches you to implement complex AI applications through a…☆86Updated 11 months ago
- Multi-Agents using Workflows☆51Updated 5 months ago
- ☆61Updated 2 months ago
- This software contains an agent based on LangGraph & LangChain for solving general requests in the Whatsapp channel of this medical clini…☆200Updated 8 months ago
- Research assistant for performing online research on a given topic, using Llamaindex Workflows and Tavily API. Inspired by GPT-Researcher☆162Updated 8 months ago
- Framework for building, orchestrating and deploying multi-agent systems. Managed by OpenAI Solutions team. Experimental framework.☆91Updated 7 months ago
- This is the official companion repository for the book The Complete LangGraph Blueprint: Build 50+ AI Agents for Business Success. The re…☆47Updated 2 months ago