jlko / semantic_uncertainty
Codebase for reproducing the experiments of the semantic uncertainty paper (short-phrase and sentence-length experiments).
☆247Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for semantic_uncertainty
- ☆142Updated 5 months ago
- This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.☆412Updated 9 months ago
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆175Updated last month
- A Survey on Data Selection for Language Models☆183Updated last month
- Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"☆430Updated 7 months ago
- Codebase for reproducing the experiments of the semantic uncertainty paper (paragraph-length experiments).☆43Updated 7 months ago
- List of papers on hallucination detection in LLMs.☆682Updated this week
- LLM hallucination paper list☆293Updated 8 months ago
- Code and data for "Lost in the Middle: How Language Models Use Long Contexts"☆318Updated 10 months ago
- SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models☆471Updated 4 months ago
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆144Updated last month
- Inference-Time Intervention: Eliciting Truthful Answers from a Language Model☆471Updated last month
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…☆292Updated 6 months ago
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆96Updated last month
- RewardBench: the first evaluation tool for reward models.☆437Updated last month
- Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models☆76Updated 3 months ago
- Using sparse coding to find distributed representations used by neural networks.☆188Updated last year
- ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)☆235Updated 7 months ago
- Must-read Papers on Large Language Model (LLM) Continual Learning☆134Updated last year
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆116Updated last month
- [ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning☆376Updated last month
- The official repo of paper "Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller"☆18Updated 3 months ago
- Representation Engineering: A Top-Down Approach to AI Transparency☆730Updated 3 months ago
- Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.☆220Updated this week
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆102Updated 2 months ago
- code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"☆74Updated 8 months ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆97Updated 7 months ago
- A Survey of Attributions for Large Language Models☆169Updated 3 months ago
- A resource repository for representation engineering in large language models☆54Updated last week
- This repository collects all relevant resources about interpretability in LLMs☆289Updated 3 weeks ago