☆63Dec 15, 2024Updated last year
Alternatives and similar repositories for LoRAPrune
Users that are interested in LoRAPrune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation for LaCo (EMNLP 2024 Findings)☆22Oct 3, 2024Updated last year
- Structured Pruning Adapters in PyTorch☆19Aug 30, 2023Updated 2 years ago
- [ICML24] Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for LLMs☆100Nov 25, 2024Updated last year
- A simple and effective LLM pruning approach.☆867Aug 9, 2024Updated last year
- BESA is a differentiable weight pruning technique for large language models.☆17Mar 4, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ICML'24 Oral] APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference☆48Jun 4, 2024Updated 2 years ago
- ☆19Dec 7, 2025Updated 6 months ago
- [NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baich…☆1,129Oct 7, 2024Updated last year
- A block pruning framework for LLMs.☆28May 17, 2025Updated last year
- Code for "Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes"☆32Mar 28, 2024Updated 2 years ago
- [AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models☆76Jan 6, 2024Updated 2 years ago
- Code for RepNAS☆14Dec 21, 2021Updated 4 years ago
- Official Pytorch Implementation of Our Paper Accepted at ICLR 2024-- Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLM…☆51Apr 9, 2024Updated 2 years ago
- [ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning☆641Mar 4, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆17Jul 25, 2023Updated 2 years ago
- [ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs☆65Apr 12, 2026Updated 2 months ago
- 4-bit Shampoo for Memory-Efficient Network Training (NeurIPS 2024)☆13Feb 13, 2025Updated last year
- Are gradient information useful for pruning of LLMs?☆48Aug 23, 2025Updated 10 months ago
- [ICLR 2025] Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stit…☆107Feb 26, 2024Updated 2 years ago
- For releasing code related to compression methods for transformers, accompanying our publications☆462Jan 16, 2025Updated last year
- DeepSeek-V3.2-Exp DSA Warmup Lightning Indexer training operator based on tilelang☆47Nov 19, 2025Updated 7 months ago
- [TMLR 2026] Is Oracle Pruning the True Oracle?☆26Jun 20, 2026Updated 2 weeks ago
- Official implementation of the ICLR paper "Streamlining Redundant Layers to Compress Large Language Models"☆43May 1, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆12Oct 9, 2023Updated 2 years ago
- [ICLR 2025] Official implementation of paper "Dynamic Low-Rank Sparse Adaptation for Large Language Models".☆25Mar 16, 2025Updated last year
- [ICCV 2025] The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"☆62Apr 5, 2025Updated last year
- ☆23Nov 26, 2024Updated last year
- Official implementation of ICLR 2025 'LORO: Parameter and Memory Efficient Pretraining via Low-rank Riemannian Optimization'☆18Apr 24, 2025Updated last year
- NCTU(NYCU) Deep Learning and Practice Spring 2021☆11Jun 21, 2022Updated 4 years ago
- A Survey on Vulnerability of Federated Learning: An Algorithm Perspective☆18May 30, 2024Updated 2 years ago
- A federated image segmentation method based on style transfer☆16Sep 28, 2024Updated last year
- ☆24Oct 14, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model☆22Jul 20, 2024Updated last year
- (CVPR 2024) "Unsegment Anything by Simulating Deformation"☆29May 27, 2024Updated 2 years ago
- "Efficient Federated Learning for Modern NLP", to appear at MobiCom 2023.☆34Aug 18, 2023Updated 2 years ago
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.☆48Oct 10, 2024Updated last year
- [ICCV2023 Official PyTorch code] for Iterative Soft Shrinkage Learning for Efficient Image Super-Resolution☆29Mar 10, 2024Updated 2 years ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆32Nov 4, 2024Updated last year
- Vico: Compositional Video Generation as Flow Equalization☆59Nov 15, 2024Updated last year