SensiMix: Sensitivity-Aware 8-bit Index & 1-bit Value Mixed Precision Quantization for BERT Compression (PLOS One)
☆34Aug 22, 2025Updated 6 months ago
Alternatives and similar repositories for SensiMix
Users that are interested in SensiMix are comparing it to the libraries listed below
Sorting:
- ☆33Dec 9, 2022Updated 3 years ago
- Pea-KD: Parameter-efficient and accurate knowledge distillation on BERT (PLOS One)☆35Aug 22, 2025Updated 6 months ago
- Vector multiplication on Low-rank Matrix Factorization☆46Nov 10, 2023Updated 2 years ago
- Flexible Convolutional Neural Network☆23Nov 15, 2023Updated 2 years ago
- SynQ: Accurate Zero-shot Quantization by Synthesis-aware Fine-tuning (ICLR 2025)☆26Feb 7, 2025Updated last year
- Edge-guided Model Inversion for Accurate Data-Free Applications☆22Nov 13, 2025Updated 3 months ago
- Accurate Retraining-free Pruning for Pretrained Encoder-based Language Models (ICLR 2024)☆14May 31, 2025Updated 9 months ago
- Model-Agnostic Augmentation for Accurate Graph Classification (WWW 2022)☆21Aug 22, 2025Updated 6 months ago
- A dataset repository of "Accurate Action Recommendation for Smart Home via Two-Level Encoders and Commonsense Knowledge" (CIKM 2022)☆16Aug 20, 2025Updated 6 months ago
- Belief Propagation Network for Hard Inductive Semi-Supervised Learning (IJCAI 2019)☆22Jul 6, 2023Updated 2 years ago
- Fast and Accurate Partial Fourier Transform for Time Series Data (KDD 2021)☆16Aug 19, 2025Updated 6 months ago
- [ICML 2024] Sparse Model Inversion: Efficient Inversion of Vision Transformers with Less Hallucination☆13Apr 29, 2025Updated 10 months ago
- Official implementation of the ICLR paper "Streamlining Redundant Layers to Compress Large Language Models"☆41May 1, 2025Updated 10 months ago
- ☆12Oct 9, 2023Updated 2 years ago
- A PyTorch To Keras Model Converter☆10Aug 25, 2022Updated 3 years ago
- 커버리스트 - 북 커버 생성 AI 서비스☆13Sep 11, 2022Updated 3 years ago
- Code for paper: Unraveling the Shift of Visual Information Flow in MLLMs: From Phased Interaction to Efficient Inference☆13Jun 7, 2025Updated 9 months ago
- 한국어 생성 문서의 원소 사실 관계에 대한 설명 기술☆17Dec 16, 2024Updated last year
- ☆14Oct 6, 2023Updated 2 years ago
- This repo implements VGG's Comparator Network [1].☆11Sep 4, 2018Updated 7 years ago
- Structured Pruning Adapters in PyTorch☆19Aug 30, 2023Updated 2 years ago
- Pytorch implementation of our paper accepted by ECCV 2022-- Fine-grained Data Distribution Alignment for Post-Training Quantization☆15Sep 13, 2022Updated 3 years ago
- Accurate Node Feature Estimation with Structured Variational Graph Autoencoder (KDD 2022)☆19Apr 6, 2023Updated 2 years ago
- EXIT : Extrapolation and Interpolation-based Neural Controlled Differential Equations for Time-series Classification and Forecasting☆14Oct 18, 2023Updated 2 years ago
- Image-to-image translation in PyTorch (e.g. horse2zebra, edges2cats, and more)☆16Aug 8, 2017Updated 8 years ago
- Official Implementation of "D4Explainer: In-Distribution GNN Explanations via Discrete Denoising Diffusion"☆24Oct 29, 2023Updated 2 years ago
- ☆25Dec 18, 2023Updated 2 years ago
- ☆89Mar 28, 2024Updated last year
- A PyTorch implemenation of real XNOR-popcount (1-bit op) GEMM Linear PyTorch extension support both CPU and CUDA☆24Jun 6, 2023Updated 2 years ago
- torch_quantizer is a out-of-box quantization tool for PyTorch models on CUDA backend, specially optimized for Diffusion Models.☆24Mar 29, 2024Updated last year
- ☆45Jun 25, 2025Updated 8 months ago
- Official PyTorch implementation of "LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging" (ICML 2024)☆31Aug 15, 2024Updated last year
- An implementation on "Curved-Voxel Clustering for Accurate Segmentation of 3D LiDAR Point Clouds with Real-Time Performance" from IROS 20…☆241Jan 25, 2022Updated 4 years ago
- Attentive Co-Evolving Neural Ordinary Differential Equations☆30Oct 16, 2023Updated 2 years ago
- [ECCV 2024] AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer☆42Dec 9, 2024Updated last year
- [AAAI 2025] HiRED strategically drops visual tokens in the image encoding stage to improve inference efficiency for High-Resolution Visio…☆44Apr 18, 2025Updated 10 months ago
- [CVPR 2025] DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models☆67Dec 1, 2025Updated 3 months ago
- ☆43Oct 31, 2024Updated last year
- [ICML 2024] Official Implementation of SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer Blocks☆39Feb 4, 2025Updated last year