leonardopsantos / jetsonTX2Power
☆16Updated 6 years ago
Alternatives and similar repositories for jetsonTX2Power:
Users that are interested in jetsonTX2Power are comparing it to the libraries listed below
- This is a PyTorch implementation of the Scalpel. Node pruning for five benchmark networks and SIMD-aware weight pruning for LeNet-300-100…☆41Updated 6 years ago
- Implementing CNN code in CUDA and OpenCL to evaluate its performance on NVIDIA GPUs, AMD GPUs, and an FPGA platform.☆54Updated 7 years ago
- ☆36Updated 7 years ago
- ☆39Updated 7 years ago
- TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together☆64Updated 6 years ago
- ☆36Updated 6 years ago
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆27Updated 5 years ago
- Code for "Fast Sparse ConvNets" CVPR2020 submissions☆13Updated 5 years ago
- Benchmarking Analysis of Vision Kernels on Embedded CPU, GPU and FPGA☆15Updated 5 years ago
- Implementation of "NITI: Training Integer Neural Networks Using Integer-only Arithmetic" on arxiv☆80Updated 2 years ago
- The code for paper: Neuralpower: Predict and deploy energy-efficient convolutional neural networks☆21Updated 5 years ago
- This repository containts the pytorch scripts to train mixed-precision networks for microcontroller deployment, based on the memory contr…☆49Updated 10 months ago
- ☆29Updated 3 years ago
- Singular Binarized Neural Network based on GPU Bit Operations (see our SC-19 paper)☆15Updated 4 years ago
- Winograd-based convolution implementation in OpenCL☆28Updated 8 years ago
- ICML2017 MEC: Memory-efficient Convolution for Deep Neural Network C++实现(非官方)☆17Updated 5 years ago
- ☆27Updated 4 years ago
- Experiments evaluating preemption on the NVIDIA Pascal architecture☆17Updated 8 years ago
- Benchmark for matrix multiplications between dense and block sparse (BSR) matrix in TVM, blocksparse (Gray et al.) and cuSparse.☆24Updated 4 years ago
- A tool to deploy Deep Neural Networks on PULP-based SoC's☆81Updated last month
- HLS branch of Halide☆77Updated 6 years ago
- A simple script to plot the Roofline model for given HW platforms and applications☆9Updated 7 months ago
- A Hackable Quantization Library for PyTorch☆20Updated 4 years ago
- Example code and instructions on getting Tensorflow Lite running on a Xilinx Zynq☆49Updated 7 years ago
- ☆14Updated 5 years ago
- ☆40Updated 5 years ago
- Jetson embedded platform-target deep learning inference acceleration framework with TensorRT☆27Updated this week
- ☆70Updated 5 years ago
- Simulator for BitFusion☆97Updated 4 years ago
- YOLO object detector for Movidius Neural Compute Stick (NCS)☆52Updated 6 years ago