csehydrogen/Winograd-OpenCL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/csehydrogen/Winograd-OpenCL)

csehydrogen / Winograd-OpenCL

Winograd-based convolution implementation in OpenCL

☆29

Alternatives and similar repositories for Winograd-OpenCL

Users that are interested in Winograd-OpenCL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jdnie / Winograd_study
View on GitHub
理解winograd算法原理
☆10Apr 26, 2020Updated 6 years ago
md2z34 / winograd_gpu
View on GitHub
GPU implementation of Winograd convolution
☆10Oct 23, 2017Updated 8 years ago
dorthyluu / cs194-winograd
View on GitHub
☆25Dec 1, 2016Updated 9 years ago
xuqiantong / CUDA-Winograd
View on GitHub
Fast CUDA Kernels for ResNet Inference.
☆183May 26, 2019Updated 7 years ago
TaoHuUMD / Winograd_Convolution
View on GitHub
A Winograd based kernel for convolutions in deep learning framework
☆15Jul 22, 2017Updated 9 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ROCm / DirectGMA_CL
View on GitHub
Simple example showing how to use DGMA in OpenCL
☆13Feb 11, 2016Updated 10 years ago
CSshengxy / MEC
View on GitHub
ICML2017 MEC: Memory-efficient Convolution for Deep Neural Network C++实现(非官方)
☆17Apr 9, 2019Updated 7 years ago
SC-Tech-Program / Author-Kit
View on GitHub
Instructions and templates for SC authors
☆17Aug 22, 2021Updated 4 years ago
Orion34-lanbo / tvm-batch-matmul-example
View on GitHub
☆24Mar 22, 2018Updated 8 years ago
nihui / ncnn-vulkan-compute-sample
View on GitHub
☆13Mar 29, 2025Updated last year
Leonardo-Ding / gpu_sgemm
View on GitHub
☆17Jul 1, 2020Updated 6 years ago
postmalloc / skeletonide
View on GitHub
Skeletonide is a parallel implementation of Zhang-Suen morphological thinning algorithm written in Halide-lang. Use it for fast skeletoni…
☆14Oct 21, 2020Updated 5 years ago
Minhchuyentoancbn / Continual-Learning
View on GitHub
All in One - Continual Learning
☆11May 24, 2023Updated 3 years ago
nachiket / papaa-opencl
View on GitHub
OpenCL Labs for PAPAA Summer School 2016 Edition
☆46Jul 24, 2017Updated 9 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
pierpaolomori / SemanticSegmentationFPGA
View on GitHub
☆11Sep 3, 2022Updated 3 years ago
dgobbi / AIRS
View on GitHub
Atamai Image Registration and Segmentation
☆22Apr 1, 2026Updated 3 months ago
ucb-bar / fpga-spartan6
View on GitHub
Support for zScale on Spartan6 FPGAs
☆15Aug 3, 2015Updated 10 years ago
xiangze / CNN_FPGA
View on GitHub
verilog CNN generator for FPGA
☆34Jan 4, 2021Updated 5 years ago
RoySegal / tvmcon23_byoc
View on GitHub
☆11Mar 15, 2023Updated 3 years ago
ColfaxResearch / FALCON
View on GitHub
Library for fast image convolution in neural networks on Intel Architecture
☆30Jun 25, 2017Updated 9 years ago
dicecco1 / fpga_cpfp
View on GitHub
HLS Custom-Precision Floating-Point Library
☆13Nov 6, 2017Updated 8 years ago
vimar-gu / SSD
View on GitHub
[AAAI2024] Summarizing Stream Data for Memory-Restricted Online Continual Learning
☆22Apr 30, 2024Updated 2 years ago
boschresearch / what-matters-for-meta-learning
View on GitHub
[CVPR 2022] What Matters For Meta-Learning Vision Regression Tasks?
☆21Jun 13, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
YashasSamaga / ConvolutionBuildingBlocks
View on GitHub
GEMM and Winograd based convolutions using CUTLASS
☆28Jul 15, 2020Updated 6 years ago
DreamIP / haddoc2
View on GitHub
Caffe to VHDL
☆68Jun 17, 2020Updated 6 years ago
gengyl08 / NetFPGA-10G
View on GitHub
Yilong's NetFPGA-10G Repo
☆13May 7, 2015Updated 11 years ago
aravinds92 / Systolic-Array
View on GitHub
Systolic array based hardware for Image processing on the SPARTAN-6 FPGA
☆13May 26, 2016Updated 10 years ago
aaronshappell / tage-predictor
View on GitHub
SystemVerilog implemention of the TAGE branch predictor
☆14May 26, 2021Updated 5 years ago
rocmarchive / ROCm-Profiler
View on GitHub
ROCm Command Line Profiler - Updated moved to https://github.com/GPUOpen-Tools/RCP
☆10Aug 24, 2017Updated 8 years ago
nihui / ncnn-android-vkpeak
View on GitHub
ncnn android vkpeak
☆25May 27, 2026Updated 2 months ago
saelo / deeplearn
View on GitHub
OpenCL deep learning toolkit
☆17Jun 15, 2025Updated last year
HPCRL / ASPLOS_artifact
View on GitHub
☆13Nov 1, 2021Updated 4 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
codyjrivera / tsm2x-imp
View on GitHub
Implementation of TSM2L and TSM2R -- High-Performance Tall-and-Skinny Matrix-Matrix Multiplication Algorithms for CUDA
☆35Jul 28, 2020Updated 6 years ago
zhiqwang / shufaCV
View on GitHub
☆26May 22, 2023Updated 3 years ago
WalkerLau / GPU-CNN
View on GitHub
Accelerate convolution neural network for face recognition using GPU
☆15Nov 24, 2020Updated 5 years ago
ashleetiw / Lane-detection-pointclouds
View on GitHub
☆23Jun 12, 2021Updated 5 years ago
hijiangtao / infovis-ucas
View on GitHub
Programming Assignment Project for Information Visualization Course on University of Chinese Academy of Sciences
☆12Mar 10, 2017Updated 9 years ago
zhangxinqian / jetsontx2-cross-compilation-using-nnvm-and-tvm
View on GitHub
nnvm&tvm example of cross compilation and deployment in Nvidia Jetson TX2 platform
☆11Apr 17, 2018Updated 8 years ago
flame / tblis-strassen
View on GitHub
Strassen's Algorithm for Tensor Contraction
☆15Jul 7, 2017Updated 9 years ago