jlko / semantic_uncertaintyLinks
Codebase for reproducing the experiments of the semantic uncertainty paper (short-phrase and sentence-length experiments).
☆404Updated last year
Alternatives and similar repositories for semantic_uncertainty
Users that are interested in semantic_uncertainty are comparing it to the libraries listed below
Sorting:
- ☆183Updated last year
- ☆642Updated 6 months ago
- This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.☆552Updated 2 years ago
- Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"☆535Updated last year
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆295Updated 3 weeks ago
- LLM hallucination paper list☆331Updated last year
- Codebase for reproducing the experiments of the semantic uncertainty paper (paragraph-length experiments).☆78Updated last year
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…☆415Updated 9 months ago
- A Survey on Data Selection for Language Models☆253Updated 9 months ago
- A Survey of Attributions for Large Language Models☆222Updated 3 weeks ago
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆198Updated last year
- code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"☆144Updated last year
- A resource repository for representation engineering in large language models☆148Updated last year
- Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models☆811Updated 8 months ago
- awesome SAE papers☆71Updated 8 months ago
- ☆519Updated 6 months ago
- This repository collects all relevant resources about interpretability in LLMs☆391Updated last year
- SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models☆601Updated last year
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆163Updated 2 months ago
- ☆51Updated last year
- This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.☆567Updated last year
- ☆186Updated 3 weeks ago
- Inference-Time Intervention: Eliciting Truthful Answers from a Language Model☆570Updated last year
- Source code of our paper MIND, ACL 2024 Long Paper☆60Updated 2 months ago
- ☆42Updated last year
- [ICLR'25] DataGen: Unified Synthetic Dataset Generation via Large Language Models☆65Updated 11 months ago
- ☆158Updated 2 years ago
- [ICLR'26, NAACL'25 Demo] Toolkit & Benchmark for evaluating the trustworthiness of generative foundation models.☆125Updated 5 months ago
- A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…☆155Updated 5 months ago
- LLM Unlearning☆181Updated 2 years ago