141forever / DiaHalu
This is the repository for the paper DiaHalu: A Dialogue-level Hallucination Evaluation Benchmark for Large Language Models (EMNLP2024 findings)
☆13Updated last month
Alternatives and similar repositories for DiaHalu:
Users that are interested in DiaHalu are comparing it to the libraries listed below
- Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets and…☆33Updated 2 months ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆160Updated 11 months ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆76Updated 11 months ago
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆105Updated 4 months ago
- Official Implementation of "Probing Language Models for Pre-training Data Detection"☆17Updated last month
- ☆38Updated last year
- ☆71Updated 7 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆100Updated 3 months ago
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆63Updated last week
- ☆34Updated 2 months ago
- BeHonest: Benchmarking Honesty in Large Language Models☆31Updated 5 months ago
- The official implementation of "ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization…☆14Updated 11 months ago
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆53Updated 9 months ago
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆62Updated 10 months ago
- ☆23Updated last year
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models☆53Updated 5 months ago
- [ACL 2024] Making Long-Context Language Models Better Multi-Hop Reasoners☆15Updated 7 months ago
- SPRING: Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models☆14Updated 3 weeks ago
- [EMNLP 2024 Findings] To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models☆25Updated 2 months ago
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆50Updated 9 months ago
- ☆17Updated 11 months ago
- [COLM'24] "Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought Reasoning"☆19Updated 7 months ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆13Updated last month
- Code for "Retaining Key Information under High Compression Rates: Query-Guided Compressor for LLMs" (ACL 2024)☆14Updated 7 months ago
- Self-Knowledge Guided Retrieval Augmentation for Large Language Models (EMNLP Findings 2023)☆25Updated last year
- UniGen: A Unified Framework for Dataset Generation via Large Language Model☆38Updated last month
- EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration☆35Updated 10 months ago
- ☆12Updated last year
- Code for "Learning to Edit: Aligning LLMs with Knowledge Editing (ACL 2024)"☆31Updated 5 months ago