rungalileo / hallucination-indexView external linksLinks
Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.
☆116Jul 28, 2025Updated 6 months ago
Alternatives and similar repositories for hallucination-index
Users that are interested in hallucination-index are comparing it to the libraries listed below
Sorting:
- A repo for generating random NFTs with metadata 100% on chain!☆37Mar 8, 2024Updated last year
- Prompt Engineering for Large Language Models - Notebooks, Demos, Exercises, and Projects☆24Sep 14, 2023Updated 2 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- opentqa is a open framework of the textbook question answering, which includes xtqa, mcan, cmr, mfb, mutan.☆11Mar 27, 2021Updated 4 years ago
- ☆26Nov 7, 2022Updated 3 years ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- Drag & drop UI to build your customized LLM flow☆13Updated this week
- Examples of using Galileo for better ML data quality!!☆13Feb 5, 2026Updated last week
- Code for "Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking" (https://arxiv.org/abs/2…☆14Feb 2, 2026Updated 2 weeks ago
- Variational autoencoder in Theano☆12Sep 14, 2017Updated 8 years ago
- ☆15Oct 22, 2023Updated 2 years ago
- Utility which provides a UI to do prompt engineering within SageMaker Studio.☆14Jul 5, 2023Updated 2 years ago
- simple demo of using C# & System.Management.Automation.dll to run powershell code (b64 encoded) without powershell.exe☆14Mar 29, 2017Updated 8 years ago
- ☆17Aug 17, 2024Updated last year
- ☆12Jan 2, 2024Updated 2 years ago
- ARCADE198 Dataset from the ACL 2018 MRQA Workshop☆15Oct 29, 2018Updated 7 years ago
- 6th Place Solution for the Google - Isolated Sign Language Recognition Kaggle Competition☆13May 4, 2023Updated 2 years ago
- ☆14Jan 10, 2024Updated 2 years ago
- Synthesizing realistic and diverse text-datasets from augmented LLMs☆16Jan 26, 2026Updated 3 weeks ago
- A virtual MediaWiki development environment, built on Vagrant, VirtualBox, and Puppet.☆16Dec 1, 2016Updated 9 years ago
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆101Oct 19, 2023Updated 2 years ago
- ☆17Jul 5, 2022Updated 3 years ago
- Run Managed Assemblies with RunDll☆17Jul 2, 2018Updated 7 years ago
- ☆15Sep 25, 2024Updated last year
- Very minimal (and stateless) agent framework☆44Jan 12, 2025Updated last year
- ☆20Jul 2, 2024Updated last year
- ☆21Sep 6, 2021Updated 4 years ago
- Official implementation of ECCV24 paper: POA☆24Aug 8, 2024Updated last year
- ☆17Mar 24, 2023Updated 2 years ago
- AI tour planner agent using LlamaIndex Workflow☆49Jan 14, 2025Updated last year
- A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.☆57Apr 30, 2024Updated last year
- Easy to extend initial access scenario to help with EDR testing on Linux and Mac☆26Mar 20, 2022Updated 3 years ago
- This repository contains code for 3rd place in the Feedback-Prize---English-Language-Learning which was hosted on kaggle☆19Dec 15, 2022Updated 3 years ago
- Pre-training character n-gram embeddings☆23Nov 1, 2023Updated 2 years ago
- 6th Position Solution Code for Kaggle - LLM Science Exam Competition☆24Jul 8, 2024Updated last year
- ☆34Dec 9, 2025Updated 2 months ago
- MiSS is a novel PEFT method that features a low-rank structure but introduces a new update mechanism distinct from LoRA, achieving an exc…☆31Jan 28, 2026Updated 3 weeks ago
- Cobalt Strike log state tracking, parsing, and storage☆24Jul 18, 2019Updated 6 years ago
- Code for the 2025 ACL publication "Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs"☆32Jun 25, 2025Updated 7 months ago