thunlp / S3DeltaLinks
code for paper Sparse Structure Search for Delta Tuning
☆11Updated 2 years ago
Alternatives and similar repositories for S3Delta
Users that are interested in S3Delta are comparing it to the libraries listed below
Sorting:
- [ICML 2024] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆105Updated last week
- ☆13Updated last year
- An implementation of SEAL: Safety-Enhanced Aligned LLM fine-tuning via bilevel data selection.☆17Updated 4 months ago
- ☆10Updated last year
- ☆44Updated last year
- ☆179Updated last year
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆45Updated 8 months ago
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆59Updated 4 months ago
- Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.☆465Updated this week
- AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).☆336Updated 2 years ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆86Updated 8 months ago
- This repository contains the implementation of the paper "MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models".☆20Updated last month
- Paper list for Efficient Reasoning.☆541Updated 2 weeks ago
- A curated reading list of research in Mixture-of-Experts(MoE).☆638Updated 8 months ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆233Updated last month
- This is the official code for the paper "Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful Perturba…☆28Updated 3 months ago
- ☆23Updated last year
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!☆54Updated 3 months ago
- Awesome-Low-Rank-Adaptation☆110Updated 9 months ago
- Official implementation for "Mixture of In-Context Experts Enhance LLMs’ Awareness of Long Contexts" (Accepted by Neurips2024)☆12Updated 6 months ago
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆68Updated 6 months ago
- Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples☆38Updated 3 months ago
- ☆202Updated 8 months ago
- ☆236Updated last week
- An effective weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study uncovering how reasoning length…☆12Updated last month
- FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion☆147Updated this week
- This pytorch package implements PLATON: Pruning Large Transformer Models with Upper Confidence Bound of Weight Importance (ICML 2022).☆46Updated 2 years ago
- Official code for SEAL: Steerable Reasoning Calibration of Large Language Models for Free☆30Updated 3 months ago
- TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆164Updated 2 weeks ago
- ☆49Updated last year