deeplearning-wisc / haloscopeLinks

source code for NeurIPS'24 paper "HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection"

☆64

Alternatives and similar repositories for haloscope

Users that are interested in haloscope are comparing it to the libraries listed below

Sorting:

AlexanderVNikitin / kernel-language-entropy
Code for Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities (NeurIPS'24)
☆32Updated 11 months ago
javiferran / sae_entities
☆66Updated 9 months ago
hkust-nlp / Activation_Decoding
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)
☆63Updated last year
MiaoXiong2320 / llm-uncertainty
code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"
☆137Updated last year
jinzhuoran / RWKU
RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024
☆85Updated last year
yaojin17 / Unlearning_LLM
[ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"
☆65Updated last year
ZFancy / awesome-activation-engineering
A curated list of resources for activation engineering
☆117Updated 2 months ago
princeton-nlp / benign-data-breaks-safety
☆42Updated last year
UCSB-NLP-Chang / llm_uncertainty
☆40Updated last year
jinhaoduan / SAR
[ACL 2024] Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models
☆59Updated last year
deeplearning-wisc / picle
Official code for ICML 2024 paper on Persona In-Context Learning (PICLe)
☆26Updated last year
zlin7 / UQ-NLG
☆103Updated last year
zepingyu0512 / in-context-mechanism
code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…
☆13Updated last year
cognizant-ai-labs / semantic-density-paper
This repo contains the source code for reproducing the experimental results in semantic density paper (Neurips 2024)
☆17Updated 2 months ago
ethz-spylab / unlearning-vs-safety
☆25Updated last year
ykwon0407 / DataInf
DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)
☆78Updated last year
JasonForJoy / Model-Editing-Hurt
EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue
☆37Updated 6 months ago
EnnengYang / AdaMerging
AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.
☆97Updated last year
Thartvigsen / GRACE
[NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors
☆82Updated 11 months ago
tmlr-group / NoisyRationales
[NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"
☆37Updated 4 months ago
jaechan-repo / muse_bench
☆29Updated last year
uw-nsl / safechain
[ACL 25] SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities
☆26Updated 8 months ago
lorenzkuhn / semantic_uncertainty
☆180Updated last year
UCSB-NLP-Chang / causal_unlearn
[EMNLP 2024] "Revisiting Who's Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective"
☆32Updated last year
Vaidehi99 / InfoDeletionAttacks
☆48Updated 10 months ago
zjysteven / mink-plus-plus
[ICLR'25 Spotlight] Min-K%++: Improved baseline for detecting pre-training data of LLMs
☆50Updated 6 months ago
JacksonWuxs / UsableXAI_LLM
Using Explanations as a Tool for Advanced LLMs
☆69Updated last year
amazon-science / adaptive-in-context-learning
AdaICL: Which Examples to Annotate of In-Context Learning? Towards Effective and Efficient Selection
☆18Updated 2 years ago
tmlr-group / AR-Bench
[ICML 2025] "From Passive to Active Reasoning: Can Large Language Models Ask the Right Questions under Incomplete Information?"
☆47Updated 2 months ago
dannyallover / overthinking_the_truth
☆29Updated last year