This is the repository for the paper 'DiaHalu: A Dialogue-level Hallucination Evaluation Benchmark for Large Language Models' (EMNLP2024 findings)
☆18Apr 5, 2025Updated last year
Alternatives and similar repositories for DiaHalu
Users that are interested in DiaHalu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the repository for the paper ‘A Survey of Inductive Reasoning for Large Language Models’ (ACL2026)☆46Apr 8, 2026Updated 2 months ago
- The official code for the paper 'Towards Fair Graph Federated Learning via Incentive Mechanisms'☆18May 23, 2024Updated 2 years ago
- [AAAI 2024] MELO: Enhancing Model Editing with Neuron-indexed Dynamic LoRA☆28Apr 9, 2024Updated 2 years ago
- ☆19Mar 11, 2026Updated 3 months ago
- [IJCAI 2023] Black-box Prompt Tuning for Vision-Language Model as a Service☆18Sep 18, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆23Feb 3, 2024Updated 2 years ago
- [ACL 2025] RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios☆26Jul 2, 2025Updated 11 months ago
- ☆19Sep 18, 2024Updated last year
- ☆26Dec 2, 2022Updated 3 years ago
- ☆19Sep 1, 2025Updated 9 months ago
- open source code for NeurIPS 2024 paper☆12Nov 9, 2025Updated 7 months ago
- [ACL 2024] Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding☆17Nov 10, 2025Updated 7 months ago
- A full fledged mistral+wandb☆13Aug 16, 2024Updated last year
- Code for ICLR 2025 Paper: Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs☆25May 7, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for "FactKB: Generalizable Factuality Evaluation using Language Models Enhanced with Factual Knowledge". EMNLP 2023.☆20Dec 25, 2023Updated 2 years ago
- [ACL 2025 Findings] Official pytorch implementation of "Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Vis…☆25Jul 21, 2024Updated last year
- ☆22Jan 5, 2024Updated 2 years ago
- [AAAI 2025] ConVis: Contrastive Decoding with Hallucination Visualization for Mitigating Hallucinations in Multimodal Large Language Mode…☆26Sep 26, 2024Updated last year
- Code and data for the FACTOR paper☆53Nov 15, 2023Updated 2 years ago
- 该项目主要用来做 tcp 穿透内网(这是客户端)☆16Oct 23, 2019Updated 6 years ago
- AAAI 2025: Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge Graphs☆18Nov 9, 2024Updated last year
- The official implementation of ACL2022``Bottom-Up Constituency Parsing and Nested Named Entity Recognition with Pointer Networks''☆34Jan 12, 2023Updated 3 years ago
- RFTT: Reasoning with Reinforced Functional Token Tuning☆29Feb 12, 2026Updated 4 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- The official github repo for the open online courses: "Dive into LLMs".☆10Mar 15, 2024Updated 2 years ago
- [ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"☆25Mar 28, 2024Updated 2 years ago
- ICLR Reproducibility Challenge: Generative Adversarial Models For Learning Private And Fair Representations☆12Jan 12, 2019Updated 7 years ago
- Nex Venus Communication Library☆76Nov 17, 2025Updated 6 months ago
- NexAU (AU for Agent Universe), a general-purpose agent framework for building intelligent agents with tool capabilities.☆118May 25, 2026Updated 3 weeks ago
- Are Intermediate Layers and Labels Really Necessary? A General Language Model Distillation Method ; GKD: A General Knowledge Distillation…☆34Aug 4, 2023Updated 2 years ago
- code for COLING paper "A Hybrid Model of Classification and Generation for Spatial Relation Extraction"☆10Oct 20, 2022Updated 3 years ago
- "DeepResearch-Eval: An End-to-End Evaluation Framework for DeepResearch Systems"☆47Oct 16, 2025Updated 8 months ago
- [AAAI 2025] Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback☆34Dec 16, 2025Updated 6 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆21Aug 19, 2024Updated last year
- Align, a general text alignment function☆15Dec 7, 2023Updated 2 years ago
- This repository contains papers for a comprehensive survey on accelerated generation techniques in Large Language Models (LLMs).☆11May 24, 2024Updated 2 years ago
- The official implementation of the EMNLP 2023 paper "Paraphrase Types for Generation and Detection"☆12Oct 20, 2024Updated last year
- Official repository of paper "Context-DPO: Aligning Language Models for Context-Faithfulness"☆23Feb 17, 2025Updated last year
- COLING 2025: MBA-RAG: a Bandit Approach for Adaptive Retrieval-Augmented Generation through Question Complexity☆29Dec 23, 2024Updated last year
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆86Sep 13, 2025Updated 9 months ago