MANGA-UOFA / PTferLinks
☆11Updated 11 months ago
Alternatives and similar repositories for PTfer
Users that are interested in PTfer are comparing it to the libraries listed below
Sorting:
- Multi-GPU supported kmeans clustering for cluser-clip☆14Updated last year
- [NAACL 2022] "Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training", Yuanxin Liu, Fandong Meng, Zheng Lin, Pe…☆15Updated 3 years ago
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆69Updated last year
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆37Updated 2 years ago
- ☆35Updated last year
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆58Updated last year
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆22Updated last year
- On Transferability of Prompt Tuning for Natural Language Processing☆100Updated last year
- ☆14Updated 2 years ago
- One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning☆40Updated 2 years ago
- ☆86Updated 2 years ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆83Updated last year
- ☆57Updated last year
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]☆75Updated 11 months ago
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Updated 2 years ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆51Updated 4 months ago
- ☆54Updated last year
- Code for ICML 25 paper "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆46Updated 4 months ago
- Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.☆22Updated last year
- [ACL 2023] Code and Data Repo for Paper "Element-aware Summary and Summary Chain-of-Thought (SumCoT)"☆54Updated last year
- This is the official implementation of the paper: "Contrastive Learning of Sentence Embeddings from Scratch"☆39Updated 2 years ago
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆75Updated 5 months ago
- Retrieval as Attention☆82Updated 2 years ago
- Codebase for Hyperdecoders https://arxiv.org/abs/2203.08304☆13Updated 3 years ago
- Lightweight tool to identify Data Contamination in LLMs evaluation☆52Updated last year
- Personality Alignment of Language Models☆47Updated 4 months ago
- [EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆31Updated last year
- ☆74Updated last year
- Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"☆78Updated 2 years ago
- Code for "Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning" (EMNLP 2022) and "Empowering Parameter-Efficient Transfer Learning…☆11Updated 2 years ago