benja263 / Integer-Only-Inference-for-Deep-Learning-in-Native-CLinks

Converting a deep neural network to integer-only inference in native C via uniform quantization and the fixed-point representation.

☆26

Alternatives and similar repositories for Integer-Only-Inference-for-Deep-Learning-in-Native-C

Users that are interested in Integer-Only-Inference-for-Deep-Learning-in-Native-C are comparing it to the libraries listed below

Sorting:

pulp-platform / quantlab
☆37Updated last year
tum-ei-eda / muriscv-nn
muRISCV-NN is a collection of efficient deep learning kernels for embedded platforms and microcontrollers.
☆87Updated 3 weeks ago
ucb-bar / onnxruntime-riscv
Fork of upstream onnxruntime focused on supporting risc-v accelerators
☆87Updated 2 years ago
tum-ei-eda / mlonmcu
Tool for the deployment and analysis of TinyML applications on TFLM and MicroTVM backends
☆34Updated last week
LCAI-TIHU / SW
LCAI-TIHU SW is a software stack of the AI inference processor based on RISC-V
☆23Updated 2 years ago
pulp-platform / dory
A tool to deploy Deep Neural Networks on PULP-based SoC's
☆88Updated 2 months ago
nycu-caslab / TinyTS
This is the open-source version of TinyTS. The code is dirty so far. We may clean the code in the future.
☆19Updated 2 months ago
pulp-platform / pulp-trainlib
Floating-Point Optimized On-Device Learning Library for the PULP Platform.
☆37Updated 2 weeks ago
WuDan0399 / Integrate-NVDLA-and-TVM
☆33Updated 2 years ago
e-dupuis / awesome-approximate-dnn
Curated content for DNN approximation, acceleration ... with a focus on hardware accelerator and deployment
☆27Updated last year
EEESlab / CMix-NN
CMix-NN: Mixed Low-Precision CNN Library for Memory-Constrained Edge Devices
☆48Updated 5 years ago
wangmaolin / niti
Implementation of "NITI: Training Integer Neural Networks Using Integer-only Arithmetic" on arxiv
☆86Updated 3 years ago
fastmachinelearning / qonnx
QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX
☆161Updated this week
SameLight / ITRI-OpenDLA
Express DLA implementation for FPGA, revised based on NVDLA.
☆10Updated 6 years ago
twaclaw / matmult
A floating-point matrix multiplication implemented in hardware
☆31Updated 4 years ago
harvard-acc / EdgeBERT
HW/SW co-design of sentence-level energy optimizations for latency-aware multi-task NLP inference
☆52Updated last year
natu4u / GSOC_TensorCore
TensorCore Vector Processor for Deep Learning - Google Summer of Code Project
☆22Updated 4 years ago
pulp-platform / Deeploy
DNN Compiler for Heterogeneous SoCs
☆52Updated this week
AMDResearch / Riallto
The Riallto Open Source Project from AMD
☆84Updated 6 months ago
pulp-platform / quantlib
A library to train and deploy quantised Deep Neural Networks
☆25Updated 10 months ago
soDLA-publishment / somnia
Learn NVDLA by SOMNIA
☆43Updated 5 years ago
zhehaoxu / ai-talk
关于深度学习算法、框架、编译器、加速器的一些理解
☆16Updated 3 years ago
IntelLabs / FP8-Emulation-Toolkit
PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.
☆111Updated 10 months ago
tum-ei-eda / utvm_staticrt_codegen
This project contains a code generator that produces static C NN inference deployment code targeting tiny micro-controllers (TinyML) as r…
☆30Updated 4 years ago
jinhachung / tptpu-sim
A Toy-Purpose TPU Simulator
☆19Updated last year
NaelF / BinaryCoP
Binary Neural Network-based COVID-19 Face-Mask Wear and Positioning Predictor on Edge Devices
☆12Updated 4 years ago
jaewoosong / pocketnn
The official, proof-of-concept C++ implementation of PocketNN.
☆35Updated last month
DeepWok / mase
Machine-Learning Accelerator System Exploration Tools
☆179Updated 3 weeks ago
sefaburakokcu / quantized-yolov5
Low Precision(quantized) Yolov5
☆44Updated 7 months ago
ehw-fit / tf-approximate
Approximate layers - TensorFlow extension
☆26Updated 6 months ago