stanford-futuredata/ARES

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/stanford-futuredata/ARES)

stanford-futuredata / ARES

Automated Evaluation of RAG Systems

☆730

Alternatives and similar repositories for ARES

Users that are interested in ARES are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

vibrantlabsai / ragas
View on GitHub
Supercharge Your LLM Application Evaluations 🚀
☆15,016Feb 24, 2026Updated 5 months ago
chen700564 / RGB
View on GitHub
☆371May 17, 2024Updated 2 years ago
truera / trulens
View on GitHub
Evaluation and Tracking for LLM Experiments and AI Agents
☆3,469Updated this week
amazon-science / RAGChecker
View on GitHub
RAGChecker: A Fine-grained Framework For Diagnosing RAG
☆1,102Dec 13, 2024Updated last year
IAAR-Shanghai / CRUD_RAG
View on GitHub
CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models
☆400May 20, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
gomate-community / rageval
View on GitHub
Evaluation tools for Retrieval-augmented Generation (RAG) methods.
☆171Nov 18, 2024Updated last year
AnswerDotAI / RAGatouille
View on GitHub
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…
☆3,944May 17, 2025Updated last year
AkariAsai / self-rag
View on GitHub
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,…
☆2,413May 25, 2024Updated 2 years ago
stanfordnlp / dspy
View on GitHub
DSPy: The framework for programming—not prompting—language models
☆36,434Updated this week
confident-ai / deepeval
View on GitHub
The LLM Evaluation Framework
☆17,232Updated this week
YHPeter / Awesome-RAG-Evaluation
View on GitHub
The official repository for the paper: Evaluation of Retrieval-Augmented Generation: A Survey.
☆201Apr 25, 2025Updated last year
beir-cellar / beir
View on GitHub
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
☆2,255Oct 16, 2025Updated 9 months ago
OpenBMB / RAGEval
View on GitHub
☆237Apr 2, 2025Updated last year
Marker-Inc-Korea / AutoRAG
View on GitHub
AutoRAG: Now your agent can find anything in your computer. It gets smarter if you are using it frequently.
☆4,957Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
princeton-nlp / ALCE
View on GitHub
[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627
☆523Oct 9, 2024Updated last year
stanford-futuredata / ColBERT
View on GitHub
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
☆3,904Oct 14, 2025Updated 9 months ago
TonicAI / tonic_validate
View on GitHub
Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.
☆327Jul 10, 2025Updated last year
embeddings-benchmark / mteb
View on GitHub
MTEB: State-of-the-art evaluation of embeddings across languages and modalities
☆3,372Updated this week
microsoft / graphrag
View on GitHub
A modular graph-based Retrieval-Augmented Generation (RAG) system
☆34,956Updated this week
AnswerDotAI / rerankers
View on GitHub
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
☆1,625Dec 20, 2025Updated 7 months ago
Unstructured-IO / unstructured
View on GitHub
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…
☆15,210Updated this week
jzbjyb / FLARE
View on GitHub
Forward-Looking Active REtrieval-augmented generation (FLARE)
☆669Nov 20, 2023Updated 2 years ago
microsoft / LLMLingua
View on GitHub
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…
☆6,498Apr 8, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
facebookresearch / contriever
View on GitHub
Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning
☆780Apr 7, 2023Updated 3 years ago
EleutherAI / lm-evaluation-harness
View on GitHub
A framework for few-shot evaluation of language models.
☆13,443Jul 13, 2026Updated 2 weeks ago
dottxt-ai / outlines
View on GitHub
Structured Outputs
☆15,419Updated this week
Arize-ai / phoenix
View on GitHub
AI Observability & Evaluation
☆10,781Updated this week
RUC-NLPIR / FlashRAG
View on GitHub
⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)
☆3,535Jul 19, 2026Updated last week
FlagOpen / FlagEmbedding
View on GitHub
Retrieval and Retrieval-augmented LLMs
☆11,990Apr 22, 2026Updated 3 months ago
guidance-ai / guidance
View on GitHub
A guidance language for controlling large language models.
☆21,694May 21, 2026Updated 2 months ago
TIGER-AI-Lab / StructLM
View on GitHub
Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)
☆76Oct 19, 2024Updated last year
run-llama / llama_index
View on GitHub
LlamaIndex is the leading document agent and OCR platform
☆51,165Updated this week
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
KarelDO / xmc.dspy
View on GitHub
In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.
☆457Feb 13, 2024Updated 2 years ago
hymie122 / RAG-Survey
View on GitHub
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-…
☆1,788Aug 20, 2024Updated last year
jxzhangjhu / Awesome-LLM-RAG
View on GitHub
Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models
☆1,340Jul 22, 2026Updated last week
guardrails-ai / guardrails
View on GitHub
Adding guardrails to large language models.
☆7,217Updated this week
aurelio-labs / semantic-router
View on GitHub
Superfast AI decision making and intelligent processing of multi-modal data.
☆3,752Updated this week
IntelLabs / fastRAG
View on GitHub
Efficient Retrieval Augmentation and Generation Framework
☆1,785Jan 12, 2026Updated 6 months ago
facebookresearch / CRAG
View on GitHub
Comprehensive benchmark for RAG
☆297Jun 14, 2025Updated last year