thunlp / DeltaPapers
Must-read Papers of Parameter-Efficient Tuning (Delta Tuning) Methods on Pre-trained Models.
β282Updated last year
Alternatives and similar repositories for DeltaPapers:
Users that are interested in DeltaPapers are comparing it to the libraries listed below
- Implementation of paper "Towards a Unified View of Parameter-Efficient Transfer Learning" (ICLR 2022)β527Updated 3 years ago
- Paper List for In-context Learning π·β182Updated last year
- A plug-and-play library for parameter-efficient-tuning (Delta Tuning)β1,027Updated 7 months ago
- ICML'2022: Black-Box Tuning for Language-Model-as-a-Service & EMNLP'2022: BBTv2: Towards a Gradient-Free Future with Large Language Modelβ¦β269Updated 2 years ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learningβ162Updated last year
- Papers and Datasets on Instruction Tuning and Following. β¨β¨β¨β492Updated last year
- A collection of phenomenons observed during the scaling of big foundation models, which may be developed into consensus, principles, or lβ¦β279Updated last year
- Paper collections of retrieval-based (augmented) language model.β232Updated 11 months ago
- Awesome papers on Language-Model-as-a-Service (LMaaS)β556Updated 11 months ago
- [ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuningβ442Updated 6 months ago
- AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).β320Updated last year
- β345Updated 3 years ago
- β175Updated 9 months ago
- Collaborative Training of Large Language Models in an Efficient Wayβ415Updated 8 months ago
- [SIGIR'24] The official implementation code of MOELoRA.β162Updated 9 months ago
- Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasetsβ323Updated last year
- Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`β175Updated 5 months ago
- [EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Modelsβ206Updated last year
- Collection of training data management explorations for large language modelsβ322Updated 9 months ago
- Paper List for In-context Learning π·β854Updated 7 months ago
- PyTorch codes for "LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning"β237Updated 2 years ago
- A Survey on Data Selection for Language Modelsβ228Updated last week
- A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarksβ261Updated 9 months ago
- Must-read papers, related blogs and API tools on the pre-training and tuning methods for ChatGPT.β319Updated last year
- A paper & resource list of large language models, including course, paper, demo, figuresβ198Updated last year
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Modelsβ261Updated 7 months ago
- β397Updated 3 years ago
- LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignmentβ327Updated last year
- [NIPS2023] RRHF & Wombatβ807Updated last year
- β319Updated 9 months ago