Structured Neuron Level Pruning to compress Transformer-based models [ECCV'24]
☆17Aug 7, 2024Updated last year
Alternatives and similar repositories for SNP
Users that are interested in SNP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official NetsPresso Python package.☆48Nov 20, 2025Updated 4 months ago
- ☆29Dec 5, 2023Updated 2 years ago
- Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]☆90Sep 13, 2024Updated last year
- Korean Text Data Generator for OCR tasks.☆10Aug 20, 2020Updated 5 years ago
- HSViT: Horizontally Scalable Vision Transformer☆13Nov 6, 2024Updated last year
- Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers, Paper accepted at eXCV workshop of ECCV 2…☆30Jan 6, 2025Updated last year
- A library for training, compressing and deploying computer vision models (including ViT) with edge devices☆74Sep 29, 2025Updated 5 months ago
- Gathers data science and machine learning problem solving using PySpark and Hadoop.☆11Jan 22, 2019Updated 7 years ago
- ☆13Jul 3, 2024Updated last year
- ☆13Sep 24, 2023Updated 2 years ago
- A PyTorch Implementation for experiements in paper: Neurosurgeon: Collaborative Intelligence Between the Cloud and Mobile Edge.☆17May 29, 2023Updated 2 years ago
- ☆14Apr 25, 2023Updated 2 years ago
- ☆12May 5, 2023Updated 2 years ago
- ☆14May 3, 2024Updated last year
- This repository contains the supplementary material for the paper titled: "TRANSFORMER COMPRESSED SENSING VIA GLOBAL IMAGE TOKENS".☆13Dec 20, 2022Updated 3 years ago
- Code to reproduce the experiments of the ICLR24-paper: "Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging"☆12Oct 14, 2025Updated 5 months ago
- [ICLR 2022] "Unified Vision Transformer Compression" by Shixing Yu*, Tianlong Chen*, Jiayi Shen, Huan Yuan, Jianchao Tan, Sen Yang, Ji Li…☆55Dec 1, 2023Updated 2 years ago
- The implementation of paper : RTCoInfer: Real-time Edge-Cloud Collaborative CNN Inference for Stream Analytics on Ubiquitous Images☆17Oct 18, 2022Updated 3 years ago
- Official repository for the paper: "Trees with Attention for Set Prediction Tasks" (ICML21)☆10Jan 19, 2022Updated 4 years ago
- This repository provides a collection of LaTeX class templates designed to enhance the clarity and conciseness of the main.tex files. It …☆13Nov 13, 2025Updated 4 months ago
- [AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models☆70Jan 6, 2024Updated 2 years ago
- A collection of LaTeX Templates used by my own.☆11Apr 20, 2022Updated 3 years ago
- [ECCV 2024] Isomorphic Pruning for Vision Models☆82Jul 23, 2024Updated last year
- Cloned repository from Hugging Face Spaces (CVPR 2022 Demo)☆53Sep 29, 2022Updated 3 years ago
- MINT, Multiplier-less INTeger Quantization for Energy Efficient Spiking Neural Networks, ASP-DAC 2024, Nominated for Best Paper Award☆16Apr 12, 2024Updated last year
- [NeurIPS 2024] "AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment" by Yonggan Fu, Zhongzhi Yu,…☆19Dec 13, 2024Updated last year
- ☆57Jun 10, 2024Updated last year
- [ICCV 2023] Code for Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation☆23Dec 12, 2023Updated 2 years ago
- Official code for Cumulative Spatial Knowledge Distillation for Vision Transformers (ICCV-2023) https://openaccess.thecvf.com/content/ICC…☆15Nov 5, 2023Updated 2 years ago
- Official Implementation of paper "Distilling Long-tailed Datasets" [CVPR 2025]☆21Aug 13, 2025Updated 7 months ago
- The official code of "Building on Efficient Foundations: Effectively Training LLMs with Structured Feedforward Layers"☆19Jul 24, 2024Updated last year
- Named Entity Recognition via Attention_based CNNs-BiLSTm-CRF☆15Jun 27, 2018Updated 7 years ago
- A Compressed Stable Diffusion for Efficient Text-to-Image Generation [ECCV'24]☆313Jul 6, 2024Updated last year
- ☆21Oct 1, 2024Updated last year
- 在千问最新的多模态image-text模型Qwen3-VL-4B-Instruct 进行多种lora微调对比效果,通过langchain+RAG+多智能体(Multi-Agent)进行部署☆32Dec 14, 2025Updated 3 months ago
- [ICLR 2023] Pruning Deep Neural Networks from a Sparsity Perspective☆25Jan 8, 2024Updated 2 years ago
- [NeurIPS 2023] Token-Scaled Logit Distillation for Ternary Weight Generative Language Models☆18Dec 6, 2023Updated 2 years ago
- An LLM-powered chatbot for fediverse. A tech demo for BotKit.☆14Dec 20, 2025Updated 3 months ago
- Dynamic Spatial-Temporal Aggregation for Skeleton-Aware Sign Language Recognition (COLING2024)☆17Jun 18, 2025Updated 9 months ago