Codebase for reproducing the experiments of the semantic uncertainty paper (paragraph-length experiments).
☆80Apr 12, 2024Updated last year
Alternatives and similar repositories for long_hallucinations
Users that are interested in long_hallucinations are comparing it to the libraries listed below
Sorting:
- Codebase for reproducing the experiments of the semantic uncertainty paper (short-phrase and sentence-length experiments).☆406Apr 12, 2024Updated last year
- ☆13Feb 14, 2022Updated 4 years ago
- KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality☆40Dec 1, 2025Updated 3 months ago
- ☆30Oct 13, 2023Updated 2 years ago
- source code for NeurIPS'24 paper "HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection"☆66Apr 11, 2025Updated 10 months ago
- Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs☆40Jan 30, 2024Updated 2 years ago
- ☆19Nov 30, 2024Updated last year
- Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision☆18Apr 1, 2025Updated 11 months ago
- [NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models☆66Dec 10, 2024Updated last year
- ☆17Dec 21, 2023Updated 2 years ago
- ☆17Oct 16, 2024Updated last year
- Ferret: Faster and Effective Automated Red Teaming with Reward-Based Scoring Technique☆18Aug 22, 2024Updated last year
- Uncertainty quantification for in-context learning of large language models☆15Apr 1, 2024Updated last year
- [NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding☆22Oct 10, 2024Updated last year
- The official repo of paper "Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller"☆18Aug 13, 2024Updated last year
- ☆20Nov 3, 2024Updated last year
- ☆22Dec 9, 2023Updated 2 years ago
- ☆52Feb 12, 2025Updated last year
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆29May 22, 2025Updated 9 months ago
- ☆26Nov 21, 2022Updated 3 years ago
- mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating☆97Jan 29, 2024Updated 2 years ago
- ☆63Dec 6, 2024Updated last year
- This repository contains expert evaluation interface and data evaluation script for the OpenScholar project.☆38Nov 19, 2024Updated last year
- ☆444Updated this week
- This is the official repo for Towards Uncertainty-Aware Language Agent.☆31Aug 15, 2024Updated last year
- [CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allo…☆397Aug 24, 2024Updated last year
- GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection☆22Mar 7, 2024Updated last year
- Study and research with your docs, media, and AI in one place☆33Updated this week
- ☆46Sep 27, 2025Updated 5 months ago
- ☆32Jun 5, 2025Updated 9 months ago
- Inference-Time Intervention: Eliciting Truthful Answers from a Language Model☆572Jan 28, 2025Updated last year
- [NeurIPS 2024 D&B] Official code for "EHRNoteQA: An LLM Benchmark for Real-World Clinical Practice Using Discharge Summaries"☆41Jan 11, 2025Updated last year
- ☆35Jul 29, 2023Updated 2 years ago
- Introduction about AWESOME_ENTROPY+LRM_PAPERS☆30Dec 16, 2025Updated 2 months ago
- (Competition) 6th -- Scene-Text-Detection-and-Recognition.☆10Jun 14, 2022Updated 3 years ago
- The official code for "GUI-ReWalk: Massive Data Generation for GUI Agent via Stochastic Exploration and Intent-Aware Reasoning"☆29Jan 28, 2026Updated last month
- ☆11Jun 20, 2023Updated 2 years ago
- ☆13Nov 5, 2024Updated last year
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆85Feb 5, 2024Updated 2 years ago