vectara / awesome-agent-failuresLinks
A community curated collection of AI agent failure modes and battle-tested solutions.
☆108Updated 3 weeks ago
Alternatives and similar repositories for awesome-agent-failures
Users that are interested in awesome-agent-failures are comparing it to the libraries listed below
Sorting:
- An Automatic Prompt Optimization Framework for Large Language Models☆130Updated 2 months ago
- Open source RAG evaluation package☆315Updated 2 weeks ago
- A list of AI memory projects☆234Updated 9 months ago
- Ranking LLMs on agentic tasks☆194Updated last month
- Terminal-based AI Coding Agent, similar to Claude Code, OpenAI Codex etc. but works with many more LLMs e.g. Gemini, Groq, Deepseek☆148Updated 5 months ago
- Deep Research for your internal data☆341Updated 4 months ago
- Context Engineering Course with DSPy☆195Updated 2 months ago
- A framework for fine-tuning retrieval-augmented generation (RAG) systems.☆131Updated last week
- Dynamic Metadata based RAG Framework☆75Updated last year
- Repository demonstrating best practices and patterns for implementing agentic workflows in Python, featuring modular, scalable, and reusa…☆172Updated last year
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆136Updated 2 months ago
- Generate Tools and Toolkits from any Python SDK -- no extra code required☆53Updated 11 months ago
- Enterprise-grade memory framework for LLMs featuring GPU-optimized inference, vector storage, and automated scaling. Enables hyper-person…☆88Updated 5 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆120Updated 8 months ago
- ☆234Updated 7 months ago
- Vibe-coding tools for the LlamaIndex ecosystem☆164Updated this week
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆190Updated 2 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆90Updated 3 weeks ago
- Salesforce Enterprise Deep Research☆147Updated this week
- Tutorial for building LLM router☆231Updated last year
- ☆89Updated 5 months ago
- ☆96Updated 7 months ago
- Research assistant for performing online research on a given topic, using Llamaindex Workflows and Tavily API. Inspired by GPT-Researcher☆168Updated last year
- This repo is the central repo for all the RAG Evaluation reference material and partner workshop☆76Updated 6 months ago
- ☆181Updated 8 months ago
- ☆113Updated this week
- Verbalized Sampling, a training-free prompting strategy to mitigate mode collapse in LLMs by requesting responses with probabilities. Ach…☆233Updated this week
- Learn to build and customize multi-agent systems using the AutoGen. The course teaches you to implement complex AI applications through a…☆117Updated last year
- Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation☆105Updated 10 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆110Updated 6 months ago