thunlp / S3DeltaLinks
code for paper Sparse Structure Search for Delta Tuning
☆11Updated 3 years ago
Alternatives and similar repositories for S3Delta
Users that are interested in S3Delta are comparing it to the libraries listed below
Sorting:
- ThinK: Thinner Key Cache by Query-Driven Pruning☆27Updated 11 months ago
- ☆14Updated last year
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆52Updated last month
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆276Updated last week
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆258Updated 5 months ago
- [ICML‘24] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆123Updated 7 months ago
- ☆196Updated last year
- [EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆201Updated 2 months ago
- [ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models☆153Updated 7 months ago
- ☆10Updated last year
- ☆306Updated 7 months ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆99Updated last year
- ☆57Updated 8 months ago
- ☆63Updated last year
- Official implementation for "Mixture of In-Context Experts Enhance LLMs’ Awareness of Long Contexts" (Accepted by Neurips2024)☆13Updated last year
- [COLM 2025] SEAL: Steerable Reasoning Calibration of Large Language Models for Free☆51Updated 10 months ago
- Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"☆25Updated last year
- Code accompanying the paper "Massive Activations in Large Language Models"☆195Updated last year
- ☆175Updated last year
- ☆52Updated 2 years ago
- Model merging is a highly efficient approach for long-to-short reasoning.☆98Updated 3 months ago
- official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"☆22Updated 3 months ago
- 📜 Paper list on decoding methods for LLMs and LVLMs☆68Updated 3 months ago
- ☆58Updated 2 years ago
- ☆204Updated last month
- official code for paper Probing the Decision Boundaries of In-context Learning in Large Language Models. https://arxiv.org/abs/2406.11233…☆19Updated 6 months ago
- [TMLR 2025] Efficient Reasoning Models: A Survey☆298Updated this week
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆76Updated 11 months ago
- Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM Computing Surveys, 2025.☆659Updated this week
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!☆72Updated 10 months ago