sujitpal / llm-rag-evalLinks
Large Language Model (LLM) powered evaluator for Retrieval Augmented Generation (RAG) pipelines.
β32Updated last year
Alternatives and similar repositories for llm-rag-eval
Users that are interested in llm-rag-eval are comparing it to the libraries listed below
Sorting:
- π§ Compare how Agent systems perform on several benchmarks. ππβ103Updated 5 months ago
- LLM reads a paper and produce a working prototypeβ60Updated 9 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)β126Updated 11 months ago
- β82Updated 2 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ114Updated 9 months ago
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo rankerβ126Updated 2 months ago
- β147Updated last year
- Codebase accompanying the Summary of a Haystack paper.β80Updated last year
- Official Repo for CRMArena and CRMArena-Proβ127Updated 2 months ago
- β39Updated last year
- Repository for βPlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makersβ, NAACL24β151Updated last year
- β63Updated last year
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.β50Updated 2 years ago
- β23Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafteβ¦β81Updated last year
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paperβ¦β113Updated last year
- Simple examples using Argilla tools to build AIβ57Updated last year
- β61Updated 6 months ago
- Explore the use of DSPy for extracting features from PDFs πβ51Updated last year
- DSPY on action with OpenSource LLMs.β102Updated last year
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"β236Updated 3 months ago
- Complete example of how to build an Agentic RAG architecture with Redis, Amazon Bedrock, and LlamaIndex.β101Updated last year
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraphβ148Updated last year
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive argumentsβ96Updated 3 months ago
- Deep Research through Multi-Agents, using GraphRAGβ83Updated 4 months ago
- π A deep-dive into HyDE for Advanced LLM RAG + π‘ Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, coveraβ¦β33Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)β91Updated 11 months ago
- This repository implements the chain of verification paper by Meta AIβ188Updated 2 years ago
- [EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answeringβ118Updated 11 months ago
- Function Calling Benchmark & Testingβ92Updated last year