rungalileo / agent-leaderboardLinks
Ranking LLMs on agentic tasks
☆148Updated 2 weeks ago
Alternatives and similar repositories for agent-leaderboard
Users that are interested in agent-leaderboard are comparing it to the libraries listed below
Sorting:
- ☆215Updated 2 weeks ago
- Readymade evaluators for agent trajectories☆267Updated last month
- GenAIOps on Kubernetes: A collection of reference architectures for running GenAI at scale on Kubernetes using OSS tooling☆130Updated 8 months ago
- ☆160Updated this week
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆132Updated last month
- A list of AI memory projects☆174Updated 6 months ago
- Repository demonstrating best practices and patterns for implementing agentic workflows in Python, featuring modular, scalable, and reusa…☆145Updated 8 months ago
- ☆145Updated 11 months ago
- ☆179Updated 5 months ago
- Tutorial for building LLM router☆217Updated last year
- MCP (Model Context Protocol) server for Weaviate☆136Updated last month
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆107Updated last year
- Learn to build and customize multi-agent systems using the AutoGen. The course teaches you to implement complex AI applications through a…☆93Updated last year
- ☆122Updated 4 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆108Updated 3 months ago
- ☆76Updated 6 months ago
- ☆94Updated 3 months ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆173Updated 9 months ago
- Research assistant for performing online research on a given topic, using Llamaindex Workflows and Tavily API. Inspired by GPT-Researcher☆163Updated 9 months ago
- Testing and evaluation framework for voice agents☆128Updated last month
- Together Open Deep Research☆320Updated 3 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆276Updated last year
- Dynamic Metadata based RAG Framework☆75Updated 11 months ago
- Beating the GAIA benchmark with Transformers Agents. 🚀☆129Updated 5 months ago
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆144Updated last year
- Complete example of how to build an Agentic RAG architecture with Redis, Amazon Bedrock, and LlamaIndex.☆95Updated 7 months ago
- Official Code for Oᴘᴇɴ-RAG: Enhanced Retrieval Augmented Reasoning with Open-Source Large Language Models (EMNLP Findings 2024)☆129Updated 4 months ago
- Simple examples using Argilla tools to build AI☆53Updated 8 months ago
- ☆71Updated 4 months ago
- An agentic AI application that allows you to chat with your papers and gather also information from papers on ArXiv and on PubMed☆142Updated 2 months ago