Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.
☆116Jul 28, 2025Updated 7 months ago
Alternatives and similar repositories for hallucination-index
Users that are interested in hallucination-index are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Oct 22, 2023Updated 2 years ago
- ARCADE198 Dataset from the ACL 2018 MRQA Workshop☆15Oct 29, 2018Updated 7 years ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- Utility which provides a UI to do prompt engineering within SageMaker Studio.☆14Jul 5, 2023Updated 2 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- ☆12Jan 2, 2024Updated 2 years ago
- ☆26Nov 7, 2022Updated 3 years ago
- Prompt Engineering for Large Language Models - Notebooks, Demos, Exercises, and Projects☆24Sep 14, 2023Updated 2 years ago
- This repository is a combination of llama workflows and agents together which is a powerful concept.☆17Aug 9, 2024Updated last year
- AI tour planner agent using LlamaIndex Workflow☆49Jan 14, 2025Updated last year
- opentqa is a open framework of the textbook question answering, which includes xtqa, mcan, cmr, mfb, mutan.☆11Mar 27, 2021Updated 4 years ago
- Drag & drop UI to build your customized LLM flow☆13Updated this week
- Synthesizing realistic and diverse text-datasets from augmented LLMs☆16Jan 26, 2026Updated last month
- Code for the arxiv paper: Complex Claim Verification with Evidence Retrieved in the Wild☆13Nov 27, 2023Updated 2 years ago
- Explorium B2B Data - MCP Server☆20Mar 3, 2026Updated 3 weeks ago
- Extend bert-nmt to context-aware translation.☆11May 24, 2021Updated 4 years ago
- An open source project on estimating train delays in India.☆11Oct 29, 2018Updated 7 years ago
- Benchmarking tool for assessing LLM models' performance across different hardwares☆17Dec 8, 2023Updated 2 years ago
- Which ML are you?☆13Jan 3, 2023Updated 3 years ago
- Transfer learning for neural machine translation using cross-lingual word embeddings☆10Dec 17, 2025Updated 3 months ago
- ☆12Aug 11, 2024Updated last year
- Code for "Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking" (https://arxiv.org/abs/2…☆14Feb 2, 2026Updated last month
- STRIPS benchmarks for classical planning☆14Mar 29, 2022Updated 3 years ago
- 👩🏻🍳 A collection of example notebooks using Haystack☆526Mar 3, 2026Updated 3 weeks ago
- Eval LLMs☆11May 12, 2024Updated last year
- ☆25May 28, 2025Updated 9 months ago
- Training tiny models to prove hard theorems☆64Mar 5, 2026Updated 2 weeks ago
- Easy to use! Claude API Streamlit Version! Build your project upon this!☆12Aug 2, 2023Updated 2 years ago
- Few-Shot Relation Extraction with AllenNLP☆12Jan 27, 2019Updated 7 years ago
- This is the repository of code and dataset for paper "The Rise of Guardians: Fact-checking URL Recommendation to Combat Fake News", SIGIR…☆17Feb 19, 2022Updated 4 years ago
- A repository of projects and datasets under active development by Alignment Lab AI☆22Dec 22, 2023Updated 2 years ago
- LLM Agent that performs sentiment analysis of drawings and natural language using a combination of Google Gemini Vision model and GPT-4 T…☆13Dec 22, 2023Updated 2 years ago
- Python library for implementing Responsible AI mitigations.☆69Dec 17, 2023Updated 2 years ago
- Building self-refined guardrails via DSPy☆14Jul 2, 2024Updated last year
- Optimization solvers in pure Python: LP, MILP, SAT, constraint programming, graph and metaheuristics. No dependencies. Solvor all your op…☆26Feb 1, 2026Updated last month
- ☆15Sep 25, 2024Updated last year
- ☆13May 26, 2022Updated 3 years ago
- Custom::LexBot | AWS CloudFormation Custom Lambda Resource | Lex Bot☆10Jan 13, 2021Updated 5 years ago
- Exploring advanced prompting tools to query SQL database with multiple tables in natural language using LLMs☆16Aug 23, 2024Updated last year