141forever/DiaHalu

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/141forever/DiaHalu)

141forever / DiaHalu

This is the repository for the paper 'DiaHalu: A Dialogue-level Hallucination Evaluation Benchmark for Large Language Models' (EMNLP2024 findings)

☆18

Alternatives and similar repositories for DiaHalu

Users that are interested in DiaHalu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zfkarl / UniFER
View on GitHub
Official repository for the paper “Rethinking Facial Expression Recognition in the Era of Multimodal Large Language Models”
☆28Nov 5, 2025Updated 8 months ago
141forever / inductive-reasoning-papers
View on GitHub
The Paper Collection of Inductive Reasoning from 2015 to 2025
☆27Oct 21, 2025Updated 9 months ago
Chenglu0426 / FairGraphFL
View on GitHub
The official code for the paper 'Towards Fair Graph Federated Learning via Incentive Mechanisms'
☆18May 23, 2024Updated 2 years ago
zthang / Focus
View on GitHub
☆24Feb 3, 2024Updated 2 years ago
microsoft / ConstrainedReasoner
View on GitHub
☆13Aug 26, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Handon112358 / NeurIPS_2024_Learning-Matchable-Prior-For-Entity-Alignment-with-Unlabeled-Dangling-Cases
View on GitHub
open source code for NeurIPS 2024 paper
☆12Nov 9, 2025Updated 8 months ago
yhcc / utcie
View on GitHub
This is the code repo for the paper <UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction>
☆15Aug 10, 2023Updated 2 years ago
PostMindLab / ICD
View on GitHub
[ACL 2024] Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding
☆18Nov 10, 2025Updated 8 months ago
GasolSun36 / MVP
View on GitHub
Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning
☆24Sep 9, 2024Updated last year
tcapelle / mistral_wandb
View on GitHub
A full fledged mistral+wandb
☆13Aug 16, 2024Updated last year
Job-Bench / job-bench-eval
View on GitHub
Official eval scripts for JobBench
☆29Jul 18, 2026Updated last week
BunsenFeng / FactKB
View on GitHub
Code for "FactKB: Generalizable Factuality Evaluation using Language Models Enhanced with Factual Knowledge". EMNLP 2023.
☆20Dec 25, 2023Updated 2 years ago
ZurichRain / HMCGR
View on GitHub
code for COLING paper "A Hybrid Model of Classification and Generation for Spatial Relation Extraction"
☆10Oct 20, 2022Updated 3 years ago
LiangThree / MCMA
View on GitHub
☆16Jan 12, 2026Updated 6 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ivclab / Sound20
View on GitHub
Sound Classification Dataset
☆11Oct 18, 2018Updated 7 years ago
takomc / amp
View on GitHub
【NeurIPS 2024】The official code of paper "Automated Multi-level Preference for MLLMs"
☆22Sep 26, 2024Updated last year
weixuan-wang123 / SADI
View on GitHub
☆19Sep 1, 2025Updated 10 months ago
longshengwang / tcpthrough-client
View on GitHub
该项目主要用来做 tcp 穿透内网（这是客户端）
☆16Oct 23, 2019Updated 6 years ago
AI21Labs / factor
View on GitHub
Code and data for the FACTOR paper
☆54Nov 15, 2023Updated 2 years ago
GuoqingWang1 / Awesome-dLLM-Papers
View on GitHub
☆20Mar 11, 2026Updated 4 months ago
FUTUREEEEEE / Dynamic-RAG
View on GitHub
AAAI 2025: Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge Graphs
☆18Nov 9, 2024Updated last year
Sreyan88 / VDGD
View on GitHub
Code for ICLR 2025 Paper: Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs
☆25May 7, 2025Updated last year
cabreraalex / private-fair-GAN
View on GitHub
ICLR Reproducibility Challenge: Generative Adversarial Models For Learning Private And Fair Representations
☆12Jan 12, 2019Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
halfrot / ALaRM
View on GitHub
[ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"
☆25Mar 28, 2024Updated 2 years ago
asappresearch / scale-score
View on GitHub
☆22Jan 5, 2024Updated 2 years ago
hkust-nlp / felm
View on GitHub
Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)
☆65Dec 25, 2023Updated 2 years ago
flageval-baai / HalluDial
View on GitHub
☆21Aug 19, 2024Updated last year
byronBBL / Context-DPO
View on GitHub
Official repository of paper "Context-DPO: Aligning Language Models for Context-Faithfulness"
☆23Feb 17, 2025Updated last year
qpc1611094 / FPL
View on GitHub
Fuzzy Positive Learning (CVPR2023)
☆15Jul 25, 2024Updated 2 years ago
hhc1997 / MSCN
View on GitHub
☆12Mar 28, 2024Updated 2 years ago
QinYang79 / CRCL
View on GitHub
Cross-modal Active Complementary Learning with Self-refining Correspondence (NeurIPS 2023, Pytorch Code)
☆15Jun 6, 2024Updated 2 years ago
chkwy / SSFO
View on GitHub
☆22Sep 28, 2025Updated 9 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
zfkarl / CellVerse
View on GitHub
CellVerse: Do Large Language Models Really Understand Cell Biology?
☆16May 14, 2025Updated last year
huiwy / reflection-on-trees
View on GitHub
☆14May 9, 2024Updated 2 years ago
emorynlp / seq2seq-corenlp
View on GitHub
☆13Feb 7, 2023Updated 3 years ago
jpwahle / emnlp23-paraphrase-types
View on GitHub
The official implementation of the EMNLP 2023 paper "Paraphrase Types for Generation and Detection"
☆12Oct 20, 2024Updated last year
pspdada / SENTINEL
View on GitHub
[ICCV 2025] Official repository of "Mitigating Object Hallucinations via Sentence-Level Early Intervention".
☆31Jul 2, 2026Updated 3 weeks ago
yejipark-m / ConVis
View on GitHub
[AAAI 2025] ConVis: Contrastive Decoding with Hallucination Visualization for Mitigating Hallucinations in Multimodal Large Language Mode…
☆25Sep 26, 2024Updated last year
RapidsAtHKUST / PolarRec
View on GitHub
Source code of 'PolarRec: Improving Radio Interferometric Data Reconstruction Using Polar Coordinates', CVPR'24 accepted. By Ruoqi Wang, …
☆28Nov 25, 2025Updated 8 months ago