Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.
☆116Jul 28, 2025Updated 11 months ago
Alternatives and similar repositories for hallucination-index
Users that are interested in hallucination-index are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Oct 22, 2023Updated 2 years ago
- ARCADE198 Dataset from the ACL 2018 MRQA Workshop☆15Oct 29, 2018Updated 7 years ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- Utility which provides a UI to do prompt engineering within SageMaker Studio.☆14Jul 5, 2023Updated 2 years ago
- Testing paligemma2 finetuning on reasoning dataset☆18Dec 28, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- A language-model–powered compressor for natural language text☆52Oct 23, 2025Updated 8 months ago
- The repository for the paper: Multilingual Translation via Grafting Pre-trained Language Models☆24Sep 22, 2021Updated 4 years ago
- ☆12Jan 2, 2024Updated 2 years ago
- ☆14Jan 10, 2024Updated 2 years ago
- Variational autoencoder in Theano☆11Sep 14, 2017Updated 8 years ago
- ☆26Nov 7, 2022Updated 3 years ago
- This repository is a combination of llama workflows and agents together which is a powerful concept.☆17Aug 9, 2024Updated last year
- ☆30Oct 7, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- opentqa is a open framework of the textbook question answering, which includes xtqa, mcan, cmr, mfb, mutan.☆11Mar 27, 2021Updated 5 years ago
- Official repository for Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning☆12Sep 2, 2024Updated last year
- ☆17Jul 5, 2022Updated 3 years ago
- Synthesizing realistic and diverse text-datasets from augmented LLMs☆19Apr 4, 2026Updated 3 months ago
- Code for the arxiv paper: Complex Claim Verification with Evidence Retrieved in the Wild☆13Nov 27, 2023Updated 2 years ago
- AI Software Bill of Materials for EU AI Act☆11Jan 18, 2024Updated 2 years ago
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆99Oct 19, 2023Updated 2 years ago
- Extend bert-nmt to context-aware translation.☆11May 24, 2021Updated 5 years ago
- Code for the paper <SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning>☆47Aug 1, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆12Apr 16, 2026Updated 2 months ago
- Transfer learning for neural machine translation using cross-lingual word embeddings☆10Dec 17, 2025Updated 6 months ago
- Code for Fooling Contrastive Language-Image Pre-trainined Models with CLIPMasterPrints☆15Jan 25, 2026Updated 5 months ago
- Code for Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking, EMNLP 2022, https://aclan…☆14Mar 30, 2026Updated 3 months ago
- Meniscus - The Python Event Logging Service☆63May 17, 2015Updated 11 years ago
- STRIPS benchmarks for classical planning☆15Mar 29, 2022Updated 4 years ago
- A function to do all☆35Apr 16, 2024Updated 2 years ago
- A demonstration of the paper NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddings☆39Sep 13, 2025Updated 9 months ago
- Eval LLMs☆11May 12, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆27May 28, 2025Updated last year
- ☆11May 12, 2023Updated 3 years ago
- Python Implementation of a Bengali OCR☆10Jul 20, 2019Updated 6 years ago
- R package wrapper for Istanbul Municipality Open Data Portal☆10Apr 24, 2021Updated 5 years ago
- Adapted OS for e-ink tablets - allows to use work-related apps with no harm for eyes☆11May 17, 2020Updated 6 years ago
- Content for the supergraph.io website☆17Jun 18, 2024Updated 2 years ago
- Few-Shot Relation Extraction with AllenNLP☆12Jan 27, 2019Updated 7 years ago