jlko / long_hallucinationsView external linksLinks
Codebase for reproducing the experiments of the semantic uncertainty paper (paragraph-length experiments).
☆78Apr 12, 2024Updated last year
Alternatives and similar repositories for long_hallucinations
Users that are interested in long_hallucinations are comparing it to the libraries listed below
Sorting:
- Codebase for reproducing the experiments of the semantic uncertainty paper (short-phrase and sentence-length experiments).☆404Apr 12, 2024Updated last year
- ☆13Feb 14, 2022Updated 4 years ago
- ☆29Oct 13, 2023Updated 2 years ago
- KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality☆40Dec 1, 2025Updated 2 months ago
- Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs☆40Jan 30, 2024Updated 2 years ago
- Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision☆18Apr 1, 2025Updated 10 months ago
- ☆19Nov 30, 2024Updated last year
- [NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models☆66Dec 10, 2024Updated last year
- ☆16Oct 16, 2024Updated last year
- ☆17Dec 21, 2023Updated 2 years ago
- Ferret: Faster and Effective Automated Red Teaming with Reward-Based Scoring Technique☆18Aug 22, 2024Updated last year
- Uncertainty quantification for in-context learning of large language models☆15Apr 1, 2024Updated last year
- ☆20Nov 3, 2024Updated last year
- [NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding☆21Oct 10, 2024Updated last year
- The official repo of paper "Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller"☆18Aug 13, 2024Updated last year
- Repository for the paper 'Enhancing Clinical Decision Support with Physiological Waveforms — A Multimodal Benchmark in Emergency Care'.☆22Apr 30, 2025Updated 9 months ago
- About The official GitHub page for ''Unleashing the Potential of Large Language Models as Prompt Optimizers: An Analogical Analysis with …☆29Dec 12, 2024Updated last year
- ☆52Feb 12, 2025Updated last year
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆29May 22, 2025Updated 8 months ago
- ☆26Nov 21, 2022Updated 3 years ago
- Data and code accompanying the paper "As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive…☆22Apr 13, 2023Updated 2 years ago
- ☆63Dec 6, 2024Updated last year
- Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error Detectors (ACL 2023)☆28Mar 26, 2024Updated last year
- ☆37Aug 21, 2025Updated 5 months ago
- ☆71Jan 28, 2026Updated 2 weeks ago
- This is the official repo for Towards Uncertainty-Aware Language Agent.☆30Aug 15, 2024Updated last year
- ☆435Feb 3, 2026Updated last week
- ☆46Sep 27, 2025Updated 4 months ago
- Rethinking the User Interface of AI☆28Updated this week
- ☆32Jun 5, 2025Updated 8 months ago
- Inference-Time Intervention: Eliciting Truthful Answers from a Language Model☆570Jan 28, 2025Updated last year
- ☆42Dec 9, 2024Updated last year
- ☆13Nov 5, 2024Updated last year
- (Competition) 6th -- Scene-Text-Detection-and-Recognition.☆10Jun 14, 2022Updated 3 years ago
- [NeurIPS 2024 D&B] Official code for "EHRNoteQA: An LLM Benchmark for Real-World Clinical Practice Using Discharge Summaries"☆40Jan 11, 2025Updated last year
- ☆36Jul 29, 2023Updated 2 years ago
- The official code for "GUI-ReWalk: Massive Data Generation for GUI Agent via Stochastic Exploration and Intent-Aware Reasoning"☆29Jan 28, 2026Updated 2 weeks ago
- code for modular summarization work published in ACL2021 by Krishna et al☆30Nov 4, 2021Updated 4 years ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆85Feb 5, 2024Updated 2 years ago