stanford-futuredata / ARES
☆436Updated last month
Related projects: ⓘ
- RAGChecker: A Fine-grained Framework For Diagnosing RAG☆372Updated 2 weeks ago
- The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval☆847Updated 2 weeks ago
- Corrective Retrieval Augmented Generation☆272Updated 5 months ago
- SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models☆439Updated 2 months ago
- Evaluate your LLM's response with Prometheus and GPT4 💯☆745Updated last week
- Fine-Tuning Embedding for RAG with Synthetic Data☆456Updated last year
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆790Updated last week
- Forward-Looking Active REtrieval-augmented generation (FLARE)☆573Updated 9 months ago
- ☆772Updated 10 months ago
- A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.☆655Updated last week
- Generative Representational Instruction Tuning☆525Updated 2 weeks ago
- LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processin…☆659Updated this week
- ☆418Updated 2 months ago
- Efficient Retrieval Augmentation and Generation Framework☆1,255Updated last week
- Repository for "MultiHop-RAG: A Dataset for Evaluating Retrieval-Augmented Generation Across Documents" (COLM 2024)☆153Updated last month
- Official repository for ORPO☆409Updated 3 months ago
- Automatically evaluate your LLMs in Google Colab☆511Updated 4 months ago
- Framework for enhancing LLMs for RAG tasks using fine-tuning.☆465Updated last week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆1,396Updated this week
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆296Updated last week
- Best practices for distilling large language models.☆370Updated 7 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆362Updated 7 months ago
- The code used to train and run inference with the ColPali architecture.☆502Updated this week
- Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models☆481Updated last week
- ☆416Updated 2 months ago
- Easily embed, cluster and semantically label text datasets☆433Updated 5 months ago
- Arena-Hard-Auto: An automatic LLM benchmark.☆416Updated 2 weeks ago
- Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality s…☆390Updated 3 weeks ago
- A tool for evaluating LLMs☆376Updated 4 months ago
- Data and code for FreshLLMs (https://arxiv.org/abs/2310.03214)☆316Updated this week