jaewoosong / pocketnnLinks
The official, proof-of-concept C++ implementation of PocketNN.
☆33Updated last year
Alternatives and similar repositories for pocketnn
Users that are interested in pocketnn are comparing it to the libraries listed below
Sorting:
- Implementation of "NITI: Training Integer Neural Networks Using Integer-only Arithmetic" on arxiv☆83Updated 2 years ago
- Official implementation of "Searching for Winograd-aware Quantized Networks" (MLSys'20)☆27Updated last year
- GEMM and Winograd based convolutions using CUTLASS☆26Updated 4 years ago
- PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.☆110Updated 6 months ago
- Fast matrix multiplication for few-bit integer matrices on CPUs.☆29Updated 6 years ago
- Converting a deep neural network to integer-only inference in native C via uniform quantization and the fixed-point representation.☆25Updated 3 years ago
- [NeurIPS 2024] BLAST: Block Level Adaptive Structured Matrix for Efficient Deep Neural Network Inference☆11Updated 7 months ago
- An implementation of a BinaryConnect network for cifar10☆11Updated 5 years ago
- Post-training sparsity-aware quantization☆34Updated 2 years ago
- The Riallto Open Source Project from AMD☆80Updated last month
- Implementation of convolution layer in different flavors☆68Updated 7 years ago
- A Deep Learning Framework for the Posit Number System☆28Updated 10 months ago
- ☆29Updated 4 years ago
- Code for "Fast Sparse ConvNets" CVPR2020 submissions☆13Updated 5 years ago
- You Only Search Once: On Lightweight Differentiable Architecture Search for Resource-Constrained Embedded Platforms☆11Updated 2 years ago
- SAMO: Streaming Architecture Mapping Optimisation☆33Updated last year
- NEural Minimizer for pytOrch☆43Updated 10 months ago
- TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together☆64Updated 7 years ago
- A 8-/16-/32-/64-bit floating point number family☆17Updated 3 years ago
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆149Updated last week
- CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution☆17Updated last year
- A tool to deploy Deep Neural Networks on PULP-based SoC's☆80Updated 3 months ago
- ☆14Updated 5 years ago
- Lightweight C implementation of CNNs for Embedded Systems☆61Updated 2 years ago
- ☆11Updated last month
- CMix-NN: Mixed Low-Precision CNN Library for Memory-Constrained Edge Devices☆43Updated 5 years ago
- Fork of upstream onnxruntime focused on supporting risc-v accelerators☆87Updated 2 years ago
- ☆149Updated 2 years ago
- ColTraIn HBFP Training Emulator☆16Updated 2 years ago
- Awesome Quantization Paper lists with Codes☆11Updated 4 years ago