THUDM / Efficient-Head-Finetuning
Source code for EMNLP2022 long paper: Parameter-Efficient Tuning Makes a Good Classification Head
☆14Updated 2 years ago
Alternatives and similar repositories for Efficient-Head-Finetuning:
Users that are interested in Efficient-Head-Finetuning are comparing it to the libraries listed below
- [2024-ACL]: TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wildrounded Conversation☆47Updated last year
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"☆72Updated last year
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆45Updated 2 months ago
- Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales☆32Updated last year
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆37Updated last year
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models☆76Updated last year
- ☆23Updated last year
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models☆43Updated 9 months ago
- A curated list of resources about long-context in large-language models and video understanding.☆30Updated last year
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆32Updated 10 months ago
- ☆98Updated 5 months ago
- [ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)☆48Updated last year
- Code for EMNLP 2021 main conference paper "Dynamic Knowledge Distillation for Pre-trained Language Models"☆40Updated 2 years ago
- The code and data for the paper JiuZhang3.0☆43Updated 10 months ago
- Released code for our ICLR23 paper.☆64Updated 2 years ago
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]☆61Updated 4 months ago
- ☆15Updated 8 months ago
- ☆22Updated 5 months ago
- [EMNLP 2022] Differentiable Data Augmentation for Contrastive Sentence Representation Learning. https://arxiv.org/abs/2210.16536☆39Updated 2 years ago
- Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment☆67Updated last year
- Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process☆24Updated 7 months ago
- code for paper 《RankingGPT: Empowering Large Language Models in Text Ranking with Progressive Enhancement》☆31Updated last year
- The official code for paper "EasyGen: Easing Multimodal Generation with a Bidirectional Conditional Diffusion Model and LLMs"☆73Updated 4 months ago
- An Experiment on Dynamic NTK Scaling RoPE☆62Updated last year
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆73Updated 9 months ago
- Code for ACL2023 paper: Pre-Training to Learn in Context☆108Updated 8 months ago
- ☆33Updated last year
- Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)☆50Updated 4 months ago
- Code for "Small Models are Valuable Plug-ins for Large Language Models"☆129Updated last year
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆60Updated 4 months ago