Train neural networks with joint quantization and pruning on both weights and activations using any pytorch modules
☆43Sep 19, 2022Updated 3 years ago
Alternatives and similar repositories for qsparse
Users that are interested in qsparse are comparing it to the libraries listed below
Sorting:
- yolov5_ncnn in ununtu16.04☆10Nov 2, 2020Updated 5 years ago
- Official implementation for ECCV 2022 paper LIMPQ - "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance"☆61Mar 19, 2023Updated 2 years ago
- Generate versal system design from ONNX model. AI engine kernels. Sub-microsecond speeds for autoencoders.☆16Dec 29, 2024Updated last year
- Successfully training approximations to full-rank matrices for efficiency in deep learning.☆17Jan 5, 2021Updated 5 years ago
- ☆16Apr 1, 2022Updated 3 years ago
- nanodet_rknn on rk3399pro platform☆17Apr 17, 2022Updated 3 years ago
- Simple pytorch classification baselines for MNIST, CIFAR and ImageNet☆19Aug 7, 2019Updated 6 years ago
- argparse extension for hpman☆17Dec 4, 2022Updated 3 years ago
- Whisper in TensorRT-LLM☆17Sep 21, 2023Updated 2 years ago
- An implementation of <Group Fisher Pruning for Practical Network Compression> based on pytorch and mmcv☆18Nov 21, 2021Updated 4 years ago
- [CVPR 2020] APQ: Joint Search for Network Architecture, Pruning and Quantization Policy☆160Jun 16, 2020Updated 5 years ago
- PyTorch code and checkpoints release for VanillaKD: https://arxiv.org/abs/2305.15781☆76Nov 21, 2023Updated 2 years ago
- Deep Learning Model Optimization Using by TensorRT API, window☆16Aug 29, 2022Updated 3 years ago
- ☆23Oct 24, 2022Updated 3 years ago
- ☆17Aug 9, 2021Updated 4 years ago
- 使用TensorRT部署SlowFast模型☆24Mar 2, 2022Updated 4 years ago
- Object detection and instance segmentation on MaskRCNN with torchvision, albumentations, tensorboard and cocoapi. Supports custom coco da…☆18Sep 28, 2020Updated 5 years ago
- PyTorch implementation of Near-Lossless Post-Training Quantization of Deep Neural Networks via a Piecewise Linear Approximation☆23Feb 17, 2020Updated 6 years ago
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Jul 21, 2023Updated 2 years ago
- ☆17Mar 9, 2020Updated 5 years ago
- Pytorch implementation of RAPQ, IJCAI 2022☆23Jul 19, 2023Updated 2 years ago
- yolov5部署☆20Jun 17, 2022Updated 3 years ago
- ☆26Mar 1, 2024Updated 2 years ago
- ☆28Oct 21, 2020Updated 5 years ago
- Develop and research with PyTorch more easily.☆25Oct 12, 2018Updated 7 years ago
- ☆28Nov 29, 2022Updated 3 years ago
- [TMLR] Official PyTorch implementation of paper "Efficient Quantization-aware Training with Adaptive Coreset Selection"☆37Aug 20, 2024Updated last year
- 基于Point Transformers复现点云分割任务,并使用HAQ算法进行自动量化压缩,几乎不影响精度☆26Aug 25, 2022Updated 3 years ago
- OpenVINO™ optimization for PointPillars*☆31May 5, 2025Updated 9 months ago
- SaccadeNet : mimic how human locate accurate bounding box☆29Jul 10, 2019Updated 6 years ago
- A simple implementation of EfficientDet based on Detectron2 framework☆27Nov 3, 2020Updated 5 years ago
- C++ implementations for various tokenizers (sentencepiece, tiktoken etc).☆49Updated this week
- provide some new architecture, channel pruning and quantization methods for yolov5☆31Oct 13, 2025Updated 4 months ago
- Open Source Compiler Framework using ONNX as Frontend and IR☆33Aug 17, 2022Updated 3 years ago
- Pytorch implementation of Coherent Semantic Attention Image Inpainting☆26Nov 28, 2019Updated 6 years ago
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Sep 12, 2023Updated 2 years ago
- pytorch implementation of "Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks"☆129Jan 2, 2020Updated 6 years ago
- ☆79Jul 21, 2022Updated 3 years ago
- Post-training sparsity-aware quantization☆34Feb 26, 2023Updated 3 years ago