OATML / semantic-entropy-probesLinks
☆32Updated 10 months ago
Alternatives and similar repositories for semantic-entropy-probes
Users that are interested in semantic-entropy-probes are comparing it to the libraries listed below
Sorting:
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆59Updated last year
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆58Updated 6 months ago
- ☆51Updated last month
- ☆50Updated last year
- ☆70Updated 4 months ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆111Updated last year
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors☆77Updated 5 months ago
- LoFiT: Localized Fine-tuning on LLM Representations☆39Updated 4 months ago
- ☆94Updated last year
- ☆44Updated last year
- source code for NeurIPS'24 paper "HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection"☆45Updated last month
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".☆76Updated last year
- ☆89Updated 11 months ago
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆92Updated this week
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Ref…☆57Updated 3 months ago
- [ACL 2024 main] Aligning Large Language Models with Human Preferences through Representation Engineering (https://aclanthology.org/2024.…☆25Updated 8 months ago
- ☆166Updated 11 months ago
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆68Updated last year
- Codebase for reproducing the experiments of the semantic uncertainty paper (paragraph-length experiments).☆60Updated last year
- ☆40Updated 3 months ago
- [arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆33Updated 5 months ago
- ☆40Updated last year
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆111Updated 8 months ago
- Function Vectors in Large Language Models (ICLR 2024)☆167Updated last month
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆32Updated last year
- Exploring the Limitations of Large Language Models on Multi-Hop Queries☆25Updated 3 months ago
- Data and code for the preprint "In-Context Learning with Long-Context Models: An In-Depth Exploration"☆35Updated 9 months ago
- [NeurIPS 2024] How do Large Language Models Handle Multilingualism?☆34Updated 6 months ago
- ☆29Updated last year
- PASTA: Post-hoc Attention Steering for LLMs☆117Updated 6 months ago