benja263 / Integer-Only-Inference-for-Deep-Learning-in-Native-C
Converting a deep neural network to integer-only inference in native C via uniform quantization and the fixed-point representation.
☆21Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Integer-Only-Inference-for-Deep-Learning-in-Native-C
- ☆122Updated last year
- Floating-Point Optimized On-Device Learning Library for the PULP Platform.☆28Updated last week
- Tool for the deployment and analysis of TinyML applications on TFLM and MicroTVM backends☆30Updated this week
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆127Updated 3 weeks ago
- Fork of upstream onnxruntime focused on supporting risc-v accelerators☆81Updated last year
- ☆34Updated 8 months ago
- Low Precision(quantized) Yolov5☆31Updated 9 months ago
- A library to train and deploy quantised Deep Neural Networks☆19Updated 8 months ago
- PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.☆100Updated 11 months ago
- This is the open-source version of TinyTS. The code is dirty so far. We may clean the code in the future.☆11Updated 4 months ago
- ☆30Updated last year
- TensorCore Vector Processor for Deep Learning - Google Summer of Code Project☆21Updated 3 years ago
- muRISCV-NN is a collection of efficient deep learning kernels for embedded platforms and microcontrollers.☆64Updated this week
- A tool to deploy Deep Neural Networks on PULP-based SoC's☆79Updated 8 months ago
- ☆30Updated 3 years ago
- The code for paper: Neuralpower: Predict and deploy energy-efficient convolutional neural networks☆21Updated 5 years ago
- A Plug-and-play Lightweight tool for the Inference Optimization of Deep Neural networks☆37Updated 3 weeks ago
- CMix-NN: Mixed Low-Precision CNN Library for Memory-Constrained Edge Devices☆39Updated 4 years ago
- Learn NVDLA by SOMNIA☆26Updated 4 years ago
- Nsight Systems in Docker☆17Updated 11 months ago
- LCAI-TIHU SW is a software stack of the AI inference processor based on RISC-V☆22Updated last year
- [ICCAD'22 TinyML Contest] Efficient Heart Stroke Detection on Low-cost Microcontrollers☆15Updated last year
- A 8-/16-/32-/64-bit floating point number family☆16Updated 2 years ago
- ☆49Updated 2 weeks ago
- Lightweight C implementation of CNNs for Embedded Systems☆54Updated last year
- The Riallto Open Source Project from AMD☆68Updated last week
- FRAME: Fast Roofline Analytical Modeling and Estimation☆31Updated last year
- Adaptive floating-point based numerical format for resilient deep learning☆14Updated 2 years ago
- Several optimization methods of half-precision general matrix vector multiplication (HGEMV) using CUDA core.☆49Updated 2 months ago