rungalileo / agent-leaderboardLinks
Ranking LLMs on agentic tasks
☆192Updated last month
Alternatives and similar repositories for agent-leaderboard
Users that are interested in agent-leaderboard are comparing it to the libraries listed below
Sorting:
- ☆232Updated 3 months ago
- Readymade evaluators for agent trajectories☆345Updated last month
- Tutorial for building LLM router☆228Updated last year
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆135Updated last month
- An agent benchmark with tasks in a simulated software company.☆561Updated 3 weeks ago
- Beating the GAIA benchmark with Transformers Agents. 🚀☆136Updated 7 months ago
- Together Open Deep Research☆352Updated 5 months ago
- ☆167Updated this week
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆455Updated last month
- Repository demonstrating best practices and patterns for implementing agentic workflows in Python, featuring modular, scalable, and reusa…☆170Updated 11 months ago
- ☆73Updated 11 months ago
- Research assistant for performing online research on a given topic, using Llamaindex Workflows and Tavily API. Inspired by GPT-Researcher☆168Updated last year
- Testing and evaluation framework for voice agents☆151Updated 4 months ago
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆114Updated last year
- ☆181Updated 7 months ago
- ☆146Updated last year
- ☆189Updated this week
- ☆95Updated 6 months ago
- This is the official repository for Auto-RAG.☆224Updated 2 months ago
- Learn to build and customize multi-agent systems using the AutoGen. The course teaches you to implement complex AI applications through a…☆114Updated last year
- ☆88Updated 5 months ago
- ☆78Updated last week
- A bot with memory, built on LangGraph Cloud.☆135Updated last year
- A list of AI memory projects☆231Updated 9 months ago
- An example of multi-agent orchestration with llama-index☆431Updated 8 months ago
- A practical RAG where you can download and chat with github repo☆89Updated 8 months ago
- ☆78Updated 8 months ago
- Build datasets using natural language☆529Updated 3 weeks ago
- ☆209Updated 3 months ago
- ☆100Updated last year