THU-KEG / Skill-NeuronLinks
Source code for EMNLP2022 paper "Finding Skill Neurons in Pre-trained Transformers via Prompt Tuning".
โ18Updated 2 years ago
Alternatives and similar repositories for Skill-Neuron
Users that are interested in Skill-Neuron are comparing it to the libraries listed below
Sorting:
- [๐๐๐๐๐ ๐ ๐ข๐ง๐๐ข๐ง๐ ๐ฌ ๐๐๐๐ & ๐๐๐ ๐๐๐๐ ๐๐๐๐๐ ๐๐ซ๐๐ฅ] ๐๐ฏ๐ฉ๐ข๐ฏ๐ค๐ช๐ฏ๐จ ๐๐ข๐ต๐ฉ๐ฆ๐ฎ๐ข๐ต๐ช๐ค๐ข๐ญ ๐๐ฆ๐ข๐ด๐ฐ๐ฏ๐ช๐ฏโฆโ52Updated last year
- Official PyTorch Implementation of EMoE: Unlocking Emergent Modularity in Large Language Models [main conference @ NAACL2024]โ34Updated last year
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".โ60Updated last month
- Function Vectors in Large Language Models (ICLR 2024)โ179Updated 5 months ago
- โ30Updated last year
- โ21Updated last year
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewardsโ43Updated 5 months ago
- [ACL'25] Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuningโ20Updated 3 months ago
- โ52Updated 5 months ago
- [2025-TMLR] A Survey on the Honesty of Large Language Modelsโ59Updated 9 months ago
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Modelโ68Updated 2 years ago
- [ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"โ111Updated 2 years ago
- Learning adapter weights from task descriptionsโ19Updated last year
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"โ38Updated last year
- โ29Updated last year
- โ38Updated last year
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)โ118Updated last year
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"โ33Updated last year
- โ44Updated last year
- โ97Updated last year
- [NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"โ61Updated last year
- [ICLR 2024] This is the repository for the paper titled "DePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning"โ96Updated last year
- The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":โ38Updated last year
- One Network, Many Masks: Towards More Parameter-Efficient Transfer Learningโ40Updated 2 years ago
- Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Mergingโ108Updated last year
- Lightweight Adapting for Black-Box Large Language Modelsโ23Updated last year
- EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescueโ36Updated 3 months ago
- Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignmentโ69Updated 2 years ago
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswalโฆโ55Updated 2 years ago
- Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"โ100Updated 2 months ago