rungalileo / hallucination-index
Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.
☆106Updated 5 months ago
Alternatives and similar repositories for hallucination-index:
Users that are interested in hallucination-index are comparing it to the libraries listed below
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆48Updated last year
- ☆76Updated 8 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 7 months ago
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆101Updated 10 months ago
- Sample notebooks and prompts for LLM evaluation☆120Updated 2 months ago
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆106Updated last week
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆100Updated 10 months ago
- Mistral + Haystack: build RAG pipelines that rock 🤘☆100Updated last year
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆147Updated 4 months ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆93Updated 2 months ago
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆145Updated 10 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆100Updated 2 months ago
- Simple examples using Argilla tools to build AI☆53Updated 3 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆67Updated 4 months ago
- Chunk your text using gpt4o-mini more accurately☆43Updated 6 months ago
- Writing Blog Posts with Generative Feedback Loops!☆47Updated 11 months ago
- ☆45Updated 10 months ago
- ☆88Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆74Updated 5 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated 6 months ago
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.☆166Updated 9 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆101Updated last year
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆82Updated last week
- This repository implements the chain of verification paper by Meta AI☆163Updated last year
- ☆58Updated 10 months ago
- ☆70Updated 4 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆100Updated 5 months ago
- ☆115Updated 3 weeks ago
- Data extraction with LLM on CPU☆112Updated last year
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆118Updated last year