Tencent/PocketFlow

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Tencent/PocketFlow)

Tencent / PocketFlow

An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.

☆2,908

Alternatives and similar repositories for PocketFlow

Users that are interested in PocketFlow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

pytorch / QNNPACK
View on GitHub
Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators
☆1,550Aug 28, 2019Updated 6 years ago
ethanhe42 / channel-pruning
View on GitHub
Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)
☆1,088May 2, 2024Updated 2 years ago
Tencent / FeatherCNN
View on GitHub
FeatherCNN is a high performance inference engine for convolutional neural networks.
☆1,228Sep 24, 2019Updated 6 years ago
XiaoMi / mace
View on GitHub
MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.
☆5,042Jun 17, 2024Updated 2 years ago
microsoft / MMdnn
View on GitHub
MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Co…
☆5,806Aug 7, 2025Updated 11 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
tensorflow / adanet
View on GitHub
Fast and flexible AutoML with learning guarantees.
☆3,456Nov 30, 2023Updated 2 years ago
BUG1989 / caffe-int8-convert-tools
View on GitHub
Generate a quantization parameter file for ncnn framework int8 inference
☆517Jul 29, 2020Updated 5 years ago
Eric-mingjie / rethinking-network-pruning
View on GitHub
Rethinking the Value of Network Pruning (Pytorch) (ICLR 2019)
☆1,512Jun 7, 2020Updated 6 years ago
Tencent / ncnn
View on GitHub
ncnn is a high-performance neural network inference framework optimized for the mobile platform
☆23,554Jul 13, 2026Updated last week
sun254 / awesome-model-compression-and-acceleration
View on GitHub
a list of awesome papers on deep model ompression and acceleration
☆348Jun 19, 2021Updated 5 years ago
microsoft / nni
View on GitHub
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model c…
☆14,364Jul 3, 2024Updated 2 years ago
mit-han-lab / proxylessnas
View on GitHub
[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
☆1,446Aug 30, 2024Updated last year
Tencent / tencent-ml-images
View on GitHub
Largest multi-label image database; ResNet-101 model; 80.73% top-1 acc on ImageNet
☆3,067Apr 20, 2022Updated 4 years ago
alibaba / MNN
View on GitHub
MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba, powering high-performance on-device LLMs and Edge AI.
☆15,681Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
666DZY666 / micronet
View on GitHub
micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantiz…
☆2,266May 6, 2025Updated last year
mit-han-lab / amc-models
View on GitHub
[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices
☆168Feb 26, 2021Updated 5 years ago
apache / tvm
View on GitHub
Open Machine Learning Compiler Framework
☆13,588Updated this week
he-y / soft-filter-pruning
View on GitHub
Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks
☆385Oct 2, 2019Updated 6 years ago
facebookresearch / maskrcnn-benchmark
View on GitHub
Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.
☆9,365Feb 16, 2023Updated 3 years ago
miaow1988 / ShuffleNet_V2_pytorch_caffe
View on GitHub
ShuffleNet-V2 for both PyTorch and Caffe.
☆504Aug 9, 2018Updated 7 years ago
guan-yuan / Awesome-AutoML-and-Lightweight-Models
View on GitHub
A list of high-quality (newest) AutoML works and lightweight models including 1.) Neural Architecture Search, 2.) Lightweight Structures,…
☆856Jun 19, 2021Updated 5 years ago
D-X-Y / AutoDL-Projects
View on GitHub
Automated deep learning algorithms implemented in PyTorch.
☆1,580Apr 24, 2022Updated 4 years ago
JiahuiYu / slimmable_networks
View on GitHub
Slimmable Networks, AutoSlim, and Beyond, ICLR 2019, and ICCV 2019
☆928Mar 9, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
tensorpack / tensorpack
View on GitHub
A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility
☆6,288Aug 6, 2023Updated 2 years ago
ysh329 / deep-learning-model-convertor
View on GitHub
The convertor/conversion of deep learning models for different deep learning frameworks/softwares.
☆3,239Jun 26, 2023Updated 3 years ago
antspy / quantized_distillation
View on GitHub
Implements quantized distillation. Code for our paper "Model compression via distillation and quantization"
☆336Jul 25, 2024Updated last year
Tencent / TNN
View on GitHub
TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is …
☆4,639May 9, 2025Updated last year
quark0 / darts
View on GitHub
Differentiable architecture search for convolutional and recurrent networks
☆3,998Jan 3, 2021Updated 5 years ago
facebookresearch / kill-the-bits
View on GitHub
Code for: "And the bit goes down: Revisiting the quantization of neural networks"
☆630Nov 9, 2020Updated 5 years ago
JDAI-CV / dabnn
View on GitHub
dabnn is an accelerated binary neural networks inference framework for mobile platform
☆774Nov 12, 2019Updated 6 years ago
Rock-100 / FaceKit
View on GitHub
[CVPR 2018] Real-Time Rotation-Invariant Face Detection with Progressive Calibration Networks
☆1,083May 11, 2023Updated 3 years ago
google / gemmlowp
View on GitHub
Low-precision matrix multiplication
☆1,843Jan 29, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
megvii-model / ShuffleNet-Series
View on GitHub
☆1,513Aug 27, 2020Updated 5 years ago
SCUT-AILab / DCP
View on GitHub
Code for “Discrimination-aware-Channel-Pruning-for-Deep-Neural-Networks”
☆183Oct 29, 2020Updated 5 years ago
Robert-JunWang / Pelee
View on GitHub
Pelee: A Real-Time Object Detection System on Mobile Devices
☆884Jan 4, 2019Updated 7 years ago
jacobgil / pytorch-pruning
View on GitHub
PyTorch Implementation of [1611.06440] Pruning Convolutional Neural Networks for Resource Efficient Inference
☆885Jul 12, 2019Updated 7 years ago
mit-han-lab / once-for-all
View on GitHub
[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment
☆1,953Dec 14, 2023Updated 2 years ago
horovod / horovod
View on GitHub
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
☆14,695Jun 20, 2026Updated last month
shicai / MobileNet-Caffe
View on GitHub
Caffe Implementation of Google's MobileNets (v1 and v2)
☆1,272Jun 8, 2021Updated 5 years ago