csehydrogen / Winograd-OpenCL
Winograd-based convolution implementation in OpenCL
☆27Updated 7 years ago
Related projects: ⓘ
- ICML2017 MEC: Memory-efficient Convolution for Deep Neural Network C++实现(非官方)☆17Updated 5 years ago
- ☆39Updated 3 years ago
- Subpart source code of of deepcore v0.7☆27Updated 4 years ago
- tophub autotvm log collections☆70Updated last year
- Fast CUDA Kernels for ResNet Inference.☆164Updated 5 years ago
- ☆17Updated 4 years ago
- implementation of winograd minimal convolution algorithm on Intel Architecture☆37Updated 6 years ago
- Implement asm gemm on vega64 for 4096x4096 fp32 matrix☆19Updated 4 years ago
- ☆26Updated 7 years ago
- Optimized half precision gemm assembly kernels (deprecated due to ROCm)☆47Updated 7 years ago
- TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together☆63Updated 6 years ago
- Aiming at an AI Chip based on RISC-V and NVDLA.☆21Updated 6 years ago
- Implementing CNN code in CUDA and OpenCL to evaluate its performance on NVIDIA GPUs, AMD GPUs, and an FPGA platform.☆52Updated 7 years ago
- CNNs in Halide☆22Updated 8 years ago
- Third party assembler and GEMM library for NVIDIA Kepler GPU☆76Updated 4 years ago
- HLS branch of Halide☆76Updated 6 years ago
- Efficient Sparse-Winograd Convolutional Neural Networks (ICLR 2018)☆190Updated 5 years ago
- ☆41Updated 4 years ago
- ☆30Updated last year
- ☆76Updated this week
- Automatic Schedule Exploration and Optimization Framework for Tensor Computations☆175Updated 2 years ago
- ☆20Updated 2 years ago
- A Winograd Minimal Filter Implementation in CUDA☆20Updated 3 years ago
- ☆35Updated this week
- MAFIA: Multiple Application Framework for GPU architectures☆24Updated 2 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆106Updated 3 months ago
- A Winograd based kernel for convolutions in deep learning framework☆15Updated 7 years ago
- Implementation of convolution layer in different flavors☆68Updated 6 years ago
- OpenCL Labs for PAPAA Summer School 2016 Edition☆46Updated 7 years ago
- flexible-gemm conv of deepcore☆17Updated 4 years ago