Beryex / RLPruner-CNNLinks
RL-Pruner: Structured Pruning Using Reinforcement Learning for CNN Compression and Acceleration
☆24Updated 3 months ago
Alternatives and similar repositories for RLPruner-CNN
Users that are interested in RLPruner-CNN are comparing it to the libraries listed below
Sorting:
- The official implementation of the NeurIPS 2022 paper Q-ViT.☆96Updated 2 years ago
- List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.☆95Updated last year
- EQ-Net [ICCV 2023]☆30Updated 2 years ago
- [TMLR] Official PyTorch implementation of paper "Quantization Variation: A New Perspective on Training Transformers with Low-Bit Precisio…☆46Updated 11 months ago
- Post-Training Quantization for Vision transformers.☆225Updated 3 years ago
- [CVPR 2024] PTQ4SAM: Post-Training Quantization for Segment Anything☆79Updated last year
- [ECCV 2024] AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer☆31Updated 9 months ago
- [CVPR 2025] APHQ-ViT: Post-Training Quantization with Average Perturbation Hessian Based Reconstruction for Vision Transformers☆27Updated 5 months ago
- [ICML 2023] This project is the official implementation of our accepted ICML 2023 paper BiBench: Benchmarking and Analyzing Network Binar…☆56Updated last year
- DeiT implementation for Q-ViT☆24Updated 4 months ago
- ☆22Updated last year
- ☆24Updated last year
- The official implementation of the AAAI 2024 paper Bi-ViT.☆10Updated last year
- [NeurIPS 2023] MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory☆71Updated last year
- Offical implementation of "Integer-Valued Training and Spike-Driven Inference Spiking Neural Network for High-performance and Energy-effi…☆195Updated 2 months ago
- ☆273Updated last year
- [ECCV 2024] Isomorphic Pruning for Vision Models☆77Updated last year
- The official implementation of the ICML 2023 paper OFQ-ViT☆33Updated last year
- ☆12Updated last year
- This repo contains the code for studying the interplay between quantization and sparsity methods☆23Updated 6 months ago
- [ICML 2024] Official PyTorch implementation of "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-paramete…☆106Updated last year
- D^2-MoE: Delta Decompression for MoE-based LLMs Compression☆64Updated 5 months ago
- Awasome Papers and Resources in Deep Neural Network Pruning with Source Code.☆170Updated last year
- LLM Inference with Microscaling Format☆31Updated 10 months ago
- BinaryViT: Pushing Binary Vision Transformers Towards Convolutional Models☆37Updated last year
- ☆17Updated 11 months ago
- Harmonic-NAS: Hardware-Aware Multimodal Neural Architecture Search on Resource-constrained Devices (ACML 2023)☆14Updated last year
- super-resolution; post-training quantization; model compression☆12Updated last year
- Official implementation for ECCV 2022 paper LIMPQ - "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance"☆58Updated 2 years ago
- Join the High Accuracy Club on ImageNet with A Binary Neural Network Ticket☆70Updated 2 years ago