thunlp / S3DeltaLinks
code for paper Sparse Structure Search for Delta Tuning
☆11Updated 2 years ago
Alternatives and similar repositories for S3Delta
Users that are interested in S3Delta are comparing it to the libraries listed below
Sorting:
- ThinK: Thinner Key Cache by Query-Driven Pruning☆23Updated 6 months ago
- ☆10Updated last year
- Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples☆43Updated last month
- An implementation of SEAL: Safety-Enhanced Aligned LLM fine-tuning via bilevel data selection.☆17Updated 6 months ago
- ☆48Updated last year
- ☆14Updated last year
- ☆13Updated last month
- official code for paper Probing the Decision Boundaries of In-context Learning in Large Language Models. https://arxiv.org/abs/2406.11233…☆19Updated last month
- ☆51Updated last year
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆88Updated 10 months ago
- This repository contains the implementation of the paper "MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models".☆20Updated 3 months ago
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆49Updated 10 months ago
- ☆163Updated 3 months ago
- Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.☆510Updated this week
- [ICML 2024] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆110Updated last month
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆67Updated 6 months ago
- An effective weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study uncovering how reasoning length…☆12Updated last week
- LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning☆35Updated last year
- Official code for SEAL: Steerable Reasoning Calibration of Large Language Models for Free☆40Updated 4 months ago
- ☆23Updated 2 years ago
- Paper list for Efficient Reasoning.☆614Updated this week
- [ICLR 2025🔥] D2O: Dynamic Discriminative Operations for Efficient Long-Context Inference of Large Language Models☆20Updated last month
- This is the official implementation of ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting☆21Updated last year
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆153Updated 2 weeks ago
- Official implementation for "Mixture of In-Context Experts Enhance LLMs’ Awareness of Long Contexts" (Accepted by Neurips2024)☆13Updated 7 months ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆244Updated 2 weeks ago
- Model merging is a highly efficient approach for long-to-short reasoning.☆80Updated 2 months ago
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆104Updated 2 years ago
- TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆174Updated 2 months ago
- [ICML 2024] Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications☆83Updated 5 months ago