jaewoosong / pocketnnLinks

The official, proof-of-concept C++ implementation of PocketNN.

☆35

Alternatives and similar repositories for pocketnn

Users that are interested in pocketnn are comparing it to the libraries listed below

Sorting:

wangmaolin / niti
Implementation of "NITI: Training Integer Neural Networks Using Integer-only Arithmetic" on arxiv
☆86Updated 3 years ago
maltanar / gemmbitserial
Fast matrix multiplication for few-bit integer matrices on CPUs.
☆28Updated 6 years ago
eliberis / uNAS
μNAS is a neural architecture search (NAS) system that designs small-yet-powerful microcontroller-compatible neural networks.
☆81Updated 4 years ago
fastmachinelearning / qonnx
QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX
☆161Updated this week
pulp-platform / nemo
NEural Minimizer for pytOrch
☆45Updated last year
kunglab / ebnn
Header-only C library for Binary Neural Network Feedforward Inference (targeting small devices)
☆47Updated 3 years ago
gplhegde / convolution-flavors
Implementation of convolution layer in different flavors
☆68Updated 8 years ago
jafermarq / WinogradAwareNets
Official implementation of "Searching for Winograd-aware Quantized Networks" (MLSys'20)
☆27Updated 2 years ago
IntelLabs / FP8-Emulation-Toolkit
PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.
☆111Updated 10 months ago
areusch / microtvm-blogpost-eval
☆29Updated 4 years ago
RaulMurillo / deep-pensieve
A Deep Learning Framework for the Posit Number System
☆30Updated last year
EEESlab / CMix-NN
CMix-NN: Mixed Low-Precision CNN Library for Memory-Constrained Edge Devices
☆47Updated 5 years ago
pulp-platform / dory
A tool to deploy Deep Neural Networks on PULP-based SoC's
☆87Updated 2 months ago
fastconvnets / cvpr2020
Code for "Fast Sparse ConvNets" CVPR2020 submissions
☆12Updated 5 years ago
Xilinx / finn-base
Open Source Compiler Framework using ONNX as Frontend and IR
☆33Updated 3 years ago
changwoolee / BLAST
[NeurIPS 2024] BLAST: Block Level Adaptive Structured Matrix for Efficient Deep Neural Network Inference
☆14Updated 11 months ago
HazyResearch / butterfly
Butterfly matrix multiplication in PyTorch
☆174Updated 2 years ago
gilshm / sparq
Post-training sparsity-aware quantization
☆34Updated 2 years ago
plumerai / rethinking-bnn-optimization
Implementation for the paper "Latent Weights Do Not Exist: Rethinking Binarized Neural Network Optimization"
☆74Updated 5 years ago
larq / zoo
Reference implementations of popular Binarized Neural Networks
☆108Updated 2 weeks ago
GATECH-EIC / SuperTickets
[ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning
☆20Updated 3 years ago
AcrossV / Gated-XNOR
Ternary Weights and Activations
☆24Updated 7 years ago
AlexMontgomerie / samo
SAMO: Streaming Architecture Mapping Optimisation
☆34Updated 2 years ago
ucb-bar / onnxruntime-riscv
Fork of upstream onnxruntime focused on supporting risc-v accelerators
☆87Updated 2 years ago
mrusci / training-mixed-precision-quantized-networks
This repository containts the pytorch scripts to train mixed-precision networks for microcontroller deployment, based on the memory contr…
☆50Updated last year
sfox14 / block_minifloat
Training with Block Minifloat number representation
☆16Updated 4 years ago
maltanar / qnn-inference-examples
Jupyter notebook examples on image classification with quantized neural networks
☆69Updated 5 years ago
benja263 / Integer-Only-Inference-for-Deep-Learning-in-Native-C
Converting a deep neural network to integer-only inference in native C via uniform quantization and the fixed-point representation.
☆25Updated 3 years ago
zhutmost / neuralzip
A Out-of-box PyTorch Scaffold for Neural Network Quantization-Aware-Training (QAT) Research. Website: https://github.com/zhutmost/neuralz…
☆25Updated 2 years ago
1adrianb / expert-binary-networks
Code for High-Capacity Expert Binary Networks (ICLR 2021).
☆27Updated 3 years ago