Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.
☆116Jul 28, 2025Updated 10 months ago
Alternatives and similar repositories for hallucination-index
Users that are interested in hallucination-index are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Examples of using Galileo for better ML data quality!!☆13Feb 5, 2026Updated 4 months ago
- ☆16Oct 22, 2023Updated 2 years ago
- ARCADE198 Dataset from the ACL 2018 MRQA Workshop☆15Oct 29, 2018Updated 7 years ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- Utility which provides a UI to do prompt engineering within SageMaker Studio.☆14Jul 5, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆17May 14, 2026Updated last month
- Testing paligemma2 finetuning on reasoning dataset☆18Dec 28, 2024Updated last year
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- ☆26Nov 21, 2022Updated 3 years ago
- ☆12Jan 2, 2024Updated 2 years ago
- ☆26Nov 7, 2022Updated 3 years ago
- Smart Python OpenAI Load Balancer using priority endpoints and request retries. | Python package at link below:☆12Oct 18, 2024Updated last year
- This repository is a combination of llama workflows and agents together which is a powerful concept.☆17Aug 9, 2024Updated last year
- Prompt Engineering for Large Language Models - Notebooks, Demos, Exercises, and Projects☆24Sep 14, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- opentqa is a open framework of the textbook question answering, which includes xtqa, mcan, cmr, mfb, mutan.☆11Mar 27, 2021Updated 5 years ago
- Free and Open Platform for AI-assisted Computing☆10May 19, 2019Updated 7 years ago
- Official repository for Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning☆12Sep 2, 2024Updated last year
- LLM benchmarks☆13Feb 22, 2024Updated 2 years ago
- Synthesizing realistic and diverse text-datasets from augmented LLMs☆19Apr 4, 2026Updated 2 months ago
- Code for the arxiv paper: Complex Claim Verification with Evidence Retrieved in the Wild☆13Nov 27, 2023Updated 2 years ago
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆99Oct 19, 2023Updated 2 years ago
- Source Code for "Improved Embeddings for Learning Prerequisite Chains" (CPSC 490 - Senior Project)☆11May 2, 2019Updated 7 years ago
- Extend bert-nmt to context-aware translation.☆11May 24, 2021Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- An open source project on estimating train delays in India.☆11Oct 29, 2018Updated 7 years ago
- Code for the paper <SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning>☆47Aug 1, 2023Updated 2 years ago
- Which ML are you?☆13Jan 3, 2023Updated 3 years ago
- Transfer learning for neural machine translation using cross-lingual word embeddings☆10Dec 17, 2025Updated 5 months ago
- Code for Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking, EMNLP 2022, https://aclan…☆14Mar 30, 2026Updated 2 months ago
- Meniscus - The Python Event Logging Service☆63May 17, 2015Updated 11 years ago
- STRIPS benchmarks for classical planning☆15Mar 29, 2022Updated 4 years ago
- ☆10Aug 31, 2023Updated 2 years ago
- A demonstration of the paper NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddings☆39Sep 13, 2025Updated 9 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Token-level Reference-free Hallucination Detection☆98Jul 25, 2023Updated 2 years ago
- Repository for "Training Language Models To Explain Their Own Computations"☆22Dec 22, 2025Updated 5 months ago
- This repo is containing notes and implementations for cherry-picked publications of my particular interest☆12May 14, 2020Updated 6 years ago
- ☆27May 28, 2025Updated last year
- Few-Shot Relation Extraction with AllenNLP☆12Jan 27, 2019Updated 7 years ago
- Run Managed Assemblies with RunDll☆17Jul 2, 2018Updated 7 years ago
- This is an implementation of the paper: Searching for Best Practices in Retrieval-Augmented Generation (EMNLP2024)☆346Dec 21, 2024Updated last year