rungalileo / agent-leaderboard
Ranking LLMs on agentic tasks
☆113Updated this week
Alternatives and similar repositories for agent-leaderboard:
Users that are interested in agent-leaderboard are comparing it to the libraries listed below
- Readymade evaluators for agent trajectories☆169Updated 3 weeks ago
- ☆85Updated 3 weeks ago
- Dynamic Metadata based RAG Framework☆72Updated 8 months ago
- Build LangGraph agents with large numbers of tools☆257Updated last month
- MCP (Model Context Protocol) server for Weaviate☆100Updated last month
- Research assistant for performing online research on a given topic, using Llamaindex Workflows and Tavily API. Inspired by GPT-Researcher☆161Updated 7 months ago
- ☆98Updated last month
- AI Engineering bootcamp☆88Updated last month
- ☆90Updated last month
- ☆115Updated 4 months ago
- this project will bootstrap and scaffold the projects for specific semantic search and RAG applications along with regular boiler plate c…☆89Updated 4 months ago
- ☆64Updated 2 months ago
- This is a proof of concept repo on how to create a gradio UI using the Model Context Protocol Client Python SDK.☆58Updated 4 months ago
- ☆121Updated last month
- Multi-Agents using Workflows☆48Updated 3 months ago
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆122Updated this week
- ☆175Updated last month
- A bot with memory, built on LangGraph Cloud.☆114Updated 9 months ago
- Building LLM-Enabled Multi Agent Applications with AutoGen☆118Updated 2 weeks ago
- ☆121Updated last week
- ☆133Updated last week
- ☆41Updated last month
- A practical RAG where you can download and chat with github repo☆72Updated 2 months ago
- A fun project where I use the power of AI to analyze a PDF. The AI extracts key information based on the user's instructions and selectio…☆69Updated 6 months ago
- ☆165Updated 2 months ago
- ☆50Updated last month
- ☆81Updated 7 months ago
- This open-source project & guide shows you exactly how to implement Canvas UX pattern + LangGraph human-in-the-loop workflows in your AI …☆71Updated last month
- A list of AI memory projects☆96Updated 3 months ago
- Readymade evaluators for your LLM apps☆313Updated this week