jadohu / LANTERNLinks
Official Implementation of LANTERN (ICLR'25) and LANTERN++(ICLRW-SCOPE'25)
☆16Updated 5 months ago
Alternatives and similar repositories for LANTERN
Users that are interested in LANTERN are comparing it to the libraries listed below
Sorting:
- Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation (ICML'24 Oral)☆13Updated last year
- ☆14Updated last year
- Awesome LLM pruning papers all-in-one repository with integrating all useful resources and insights.☆111Updated last week
- Awesome-Low-Rank-Adaptation☆115Updated 9 months ago
- Welcome to the 'In Context Learning Theory' Reading Group☆29Updated 9 months ago
- ☆50Updated last year
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆103Updated 2 years ago
- Implementations for "Trimming the ℓ₁ Regularizer: Statistical Analysis, Optimization, and Applications to Deep Learning" Published on ICM…☆34Updated 4 years ago
- [ICML 2024] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆109Updated last month
- Official code implementation of "GEX: A flexible method for approximating influence via Geometric Ensemble" (NeurIPS 2023)☆13Updated last year
- ☆28Updated 5 months ago
- Code for "Surgical Fine-Tuning Improves Adaptation to Distribution Shifts" published at ICLR 2023☆29Updated 2 years ago
- Code for coreset selection methods☆238Updated 2 years ago
- ☆25Updated last year
- ThinK: Thinner Key Cache by Query-Driven Pruning☆22Updated 6 months ago
- ☆57Updated 7 months ago
- ☆11Updated 9 months ago
- An implementation of the DISP-LLM method from the NeurIPS 2024 paper: Dimension-Independent Structural Pruning for Large Language Models.☆21Updated this week
- LoRA-XS: Low-Rank Adaptation with Extremely Small Number of Parameters☆35Updated last week
- A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...☆337Updated 3 weeks ago
- Github Repo for OATS: Outlier-Aware Pruning through Sparse and Low Rank Decomposition☆13Updated 3 months ago
- ☆9Updated last year
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆47Updated 9 months ago
- ☆36Updated 2 years ago
- An implementation of the penalty-based bilevel gradient descent (PBGD) algorithm and the iterative differentiation (ITD/RHG) methods.☆18Updated 2 years ago
- Awesome Low-Rank Adaptation☆42Updated this week
- A curated reading list of research in Mixture-of-Experts(MoE).☆639Updated 9 months ago
- This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).☆48Updated last year
- [TKDE'25] The official GitHub page for the survey paper "A Survey on Mixture of Experts in Large Language Models".☆403Updated 2 weeks ago
- Code accompanying the paper "Massive Activations in Large Language Models"☆174Updated last year