thunlp / S3DeltaLinks
code for paper Sparse Structure Search for Delta Tuning
☆11Updated 3 years ago
Alternatives and similar repositories for S3Delta
Users that are interested in S3Delta are comparing it to the libraries listed below
Sorting:
- ThinK: Thinner Key Cache by Query-Driven Pruning☆24Updated 8 months ago
- ☆187Updated last year
- ☆161Updated last year
- ☆11Updated last year
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆259Updated last year
- [ICML‘24] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆111Updated 3 months ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆94Updated last year
- 📜 Paper list on decoding methods for LLMs and LVLMs☆61Updated 4 months ago
- Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.☆582Updated last week
- [EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆186Updated 4 months ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆252Updated 2 months ago
- ☆278Updated 3 months ago
- ☆53Updated 2 years ago
- official code for paper Probing the Decision Boundaries of In-context Learning in Large Language Models. https://arxiv.org/abs/2406.11233…☆19Updated 3 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆184Updated last week
- Awesome LLM pruning papers all-in-one repository with integrating all useful resources and insights.☆129Updated 2 months ago
- Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples☆44Updated 3 months ago
- ☆212Updated 7 months ago
- [ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning☆496Updated last year
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!☆69Updated 6 months ago
- Official code for SEAL: Steerable Reasoning Calibration of Large Language Models for Free☆44Updated 6 months ago
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆51Updated last year
- Paper list for Efficient Reasoning.☆709Updated this week
- 测试 https://huggingface.co/OFA-Sys/gsm8k-rft-llama7b-u13b 的 GSM8K 分数☆15Updated 2 years ago
- ☆54Updated 10 months ago
- [EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…☆15Updated last month
- FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion☆174Updated last week
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond☆309Updated last week
- AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).☆356Updated 2 years ago
- [ICLR 2025🔥] D2O: Dynamic Discriminative Operations for Efficient Long-Context Inference of Large Language Models☆24Updated 3 months ago