wichtounet / cifar-10
Simple C++ reader for CIFAR-10 dataset
☆17Updated last year
Related projects: ⓘ
- You Only Search Once: On Lightweight Differentiable Architecture Search for Resource-Constrained Embedded Platforms☆10Updated last year
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆26Updated 4 years ago
- Code for "Fast Sparse ConvNets" CVPR2020 submissions☆13Updated 4 years ago
- ONNX Parser is a tool that automatically generates openvx inference code (CNN) from onnx binary model files.☆17Updated 5 years ago
- A Winograd Minimal Filter Implementation in CUDA☆20Updated 3 years ago
- Benchmark for matrix multiplications between dense and block sparse (BSR) matrix in TVM, blocksparse (Gray et al.) and cuSparse.☆24Updated 4 years ago
- The code for our paper "Neural Architecture Search as Program Transformation Exploration"☆18Updated 3 years ago
- ☆36Updated 5 years ago
- The code for paper: Neuralpower: Predict and deploy energy-efficient convolutional neural networks☆19Updated 5 years ago
- Accelerating CNN's convolution operation on GPUs by using memory-efficient data access patterns.☆14Updated 6 years ago
- This is the implementation for paper: AdaTune: Adaptive Tensor Program CompilationMade Efficient (NeurIPS 2020).☆13Updated 3 years ago
- Code for High-Capacity Expert Binary Networks (ICLR 2021).☆26Updated 2 years ago
- Artifact for IPDPS'21: DSXplore: Optimizing Convolutional Neural Networks via Sliding-Channel Convolutions.☆13Updated 3 years ago
- ResNet Implementation, Training, and Inference Using LibTorch C++ API☆34Updated 3 months ago
- Implementation of "NITI: Training Integer Neural Networks Using Integer-only Arithmetic" on arxiv☆75Updated 2 years ago
- ICML2017 MEC: Memory-efficient Convolution for Deep Neural Network C++实现(非官方)☆17Updated 5 years ago
- BlockCIrculantRNN (LSTM and GRU) using TensorFlow☆14Updated 5 years ago
- Approximate layers - TensorFlow extension☆25Updated 4 months ago
- All about acceleration and compression of Deep Neural Networks☆33Updated 4 years ago
- ☆29Updated 3 years ago
- CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution☆16Updated last year
- A Hackable Quantization Library for PyTorch☆18Updated 3 years ago
- Test winograd convolution written in TVM for CUDA and AMDGPU☆39Updated 5 years ago
- 2D Convolution using NumPy☆17Updated 2 years ago
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆18Updated last year
- Implementation of the Winograd algorithm.☆20Updated 5 years ago
- Visualize TVM Relay program graph☆12Updated 4 years ago
- ☆18Updated this week
- An external memory allocator example for PyTorch.☆13Updated 2 years ago
- BiSUNA framework specialized to compile for the Xilinx Alveo U50☆13Updated 3 years ago