ictnlp / TruthX
Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"
☆143Updated 11 months ago
Alternatives and similar repositories for TruthX:
Users that are interested in TruthX are comparing it to the libraries listed below
- The official repository of our survey paper: "Towards a Unified View of Preference Learning for Large Language Models: A Survey"☆162Updated 4 months ago
- [ACL 2024] RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback.☆189Updated 6 months ago
- An awesome repository & A comprehensive survey on interpretability of LLM attention heads.☆325Updated 3 weeks ago
- Repo for the EMNLP'24 Paper "Dual-Space Knowledge Distillation for Large Language Models". A general white-box KD framework for both same…☆43Updated 4 months ago
- ☆68Updated 3 months ago
- LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)☆403Updated last year
- The official code repository for PRMBench.☆68Updated last month
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆160Updated last year
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆107Updated 11 months ago
- ☆170Updated 8 months ago
- A Survey on Data Selection for Language Models☆218Updated 5 months ago
- [🏆Outstanding Paper Award at ACL 2024] MMToM-QA: Multimodal Theory of Mind Question Answering☆137Updated last month
- ☆127Updated 8 months ago
- Official Code Repository for LM-Steer Paper: "Word Embeddings Are Steers for Language Models" (ACL 2024 Outstanding Paper Award)☆90Updated 5 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆116Updated 4 months ago
- ☆80Updated 2 months ago
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆208Updated this week
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆153Updated this week
- [TACL 2024] MAPS enables LLMs🤖 to mimic the human😁 translation process.☆141Updated 9 months ago
- Fantastic Data Engineering for Large Language Models☆83Updated 2 months ago
- Collection of Reverse Engineering in Large Model☆32Updated 2 months ago
- Must-read Papers on Large Language Model (LLM) Continual Learning☆141Updated last year
- [SIGIR'24] The official implementation code of MOELoRA.☆153Updated 8 months ago
- Toolkit for Prompt Compression☆249Updated last month
- A Survey of Hallucination in Large Foundation Models☆54Updated last year
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆109Updated 8 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆107Updated 6 months ago
- TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models☆66Updated last year
- FeatureAlignment = Alignment + Mechanistic Interpretability☆28Updated 2 weeks ago
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆63Updated last year