RUCAIBox / Language-Specific-Neurons
☆67Updated 2 months ago
Alternatives and similar repositories for Language-Specific-Neurons:
Users that are interested in Language-Specific-Neurons are comparing it to the libraries listed below
- ☆73Updated 9 months ago
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆63Updated last year
- EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration☆36Updated last year
- [ACL 2024] Unveiling Linguistic Regions in Large Language Models☆30Updated 9 months ago
- Official Implementation of "Probing Language Models for Pre-training Data Detection"☆17Updated 3 months ago
- Language Imbalance Driven Rewarding for Multilingual Self-improving☆15Updated 4 months ago
- [NeurIPS 2024] How do Large Language Models Handle Multilingualism?☆29Updated 4 months ago
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆106Updated 6 months ago
- Unofficial re-implementation of "Trusting Your Evidence: Hallucinate Less with Context-aware Decoding"☆28Updated 4 months ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆107Updated 11 months ago
- LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation☆20Updated last week
- Source code for Truth-Aware Context Selection: Mitigating the Hallucinations of Large Language Models Being Misled by Untruthful Contexts☆17Updated 6 months ago
- BeHonest: Benchmarking Honesty in Large Language Models☆31Updated 7 months ago
- ☆41Updated last year
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆66Updated 2 years ago
- ☆52Updated 6 months ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆58Updated last year
- Source code of ACL 2023 accepted paper "AD-KD: Attribution-Driven Knowledge Distillation for Language Model Compression"☆10Updated last year
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".☆70Updated last year
- Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets and…☆37Updated 4 months ago
- ☆17Updated last year
- ☆38Updated 4 months ago
- Safety-J: Evaluating Safety with Critique☆16Updated 7 months ago
- 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆38Updated 5 months ago
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆73Updated 2 months ago
- ☆16Updated last year
- ☆35Updated last year
- FeatureAlignment = Alignment + Mechanistic Interpretability☆28Updated last week
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆50Updated 3 months ago
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models☆54Updated 7 months ago