thunlp / DeltaPapers
Must-read Papers of Parameter-Efficient Tuning (Delta Tuning) Methods on Pre-trained Models.
β272Updated last year
Related projects: β
- Implementation of paper "Towards a Unified View of Parameter-Efficient Transfer Learning" (ICLR 2022)β510Updated 2 years ago
- Paper List for In-context Learning π·β164Updated 6 months ago
- A collection of phenomenons observed during the scaling of big foundation models, which may be developed into consensus, principles, or lβ¦β274Updated last year
- [ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuningβ337Updated 2 months ago
- AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).β253Updated last year
- LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignmentβ190Updated 4 months ago
- A plug-and-play library for parameter-efficient-tuning (Delta Tuning)β978Updated last year
- Paper collections of retrieval-based (augmented) language model.β228Updated 3 months ago
- Papers and Datasets on Instruction Tuning and Following. β¨β¨β¨β449Updated 5 months ago
- ICML'2022: Black-Box Tuning for Language-Model-as-a-Service & EMNLP'2022: BBTv2: Towards a Gradient-Free Future with Large Language Modelβ¦β258Updated last year
- Paper List for In-context Learning π·β783Updated 2 months ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learningβ144Updated 7 months ago
- The repository for the survey paper <<Survey on Large Language Models Factuality: Knowledge, Retrieval and Domain-Specificity>>β323Updated 4 months ago
- β334Updated 3 years ago
- β310Updated 2 months ago
- Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasetsβ292Updated 8 months ago
- A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarksβ239Updated last month
- An Awesome Collection for LLM Surveyβ286Updated last week
- Collaborative Training of Large Language Models in an Efficient Wayβ405Updated 3 weeks ago
- A curated reading list of research in Mixture-of-Experts(MoE).β520Updated last year
- Collection of training data management explorations for large language modelsβ264Updated last month
- β139Updated 2 months ago
- awesome papers in LLM interpretabilityβ235Updated 3 weeks ago
- A paper & resource list of large language models, including course, paper, demo, figuresβ184Updated last year
- A paper list about diffusion models for natural language processing.β170Updated last year
- Code for "Lion: Adversarial Distillation of Proprietary Large Language Models (EMNLP 2023)"β195Updated 7 months ago
- [ACL 2024] A Survey of Chain of Thought Reasoning: Advances, Frontiers and Futureβ275Updated 2 months ago
- LLM hallucination paper listβ268Updated 6 months ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other moβ¦β281Updated last week
- Must-read papers, related blogs and API tools on the pre-training and tuning methods for ChatGPT.β315Updated last year