A Winograd Minimal Filter Implementation in CUDA
☆28Aug 25, 2021Updated 4 years ago
Alternatives and similar repositories for openCNN
Users that are interested in openCNN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- GPU implementation of Winograd convolution☆10Oct 23, 2017Updated 8 years ago
- ☆14May 28, 2019Updated 6 years ago
- CUDA project for uni subject☆26Oct 26, 2020Updated 5 years ago
- Accelerating CNN's convolution operation on GPUs by using memory-efficient data access patterns.☆14Dec 8, 2017Updated 8 years ago
- ☆32Aug 24, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This project is about convolution operator optimization on GPU, include GEMM based (Implicit GEMM) convolution.☆43Sep 29, 2025Updated 6 months ago
- Fast CUDA Kernels for ResNet Inference.☆183May 26, 2019Updated 6 years ago
- Implementation of the paper - Fast Training of Convolutional Networks through FFTs (CUDA for parallelization)☆10May 8, 2020Updated 5 years ago
- Implementation of the Winograd algorithm.☆24Nov 6, 2018Updated 7 years ago
- A Row Decomposition-based Approach for Sparse Matrix Multiplication on GPUs☆29Nov 29, 2023Updated 2 years ago
- CUDA Tensor Transpose (cuTT) library☆54Aug 10, 2017Updated 8 years ago
- Efficient Sparse-Winograd Convolutional Neural Networks (ICLR 2018)☆192May 7, 2019Updated 6 years ago
- The humble incremental-search task switcher for Wox☆20Mar 23, 2014Updated 12 years ago
- Official implementation of the ICLR'25 paper "QERA: an Analytical Framework for Quantization Error Reconstruction".☆14Feb 4, 2025Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official implementation of "Searching for Winograd-aware Quantized Networks" (MLSys'20)☆26Oct 3, 2023Updated 2 years ago
- Mirror of http://gitlab.hpcrl.cse.ohio-state.edu/chong/ppopp19_ae, refactoring for understanding☆16Oct 20, 2021Updated 4 years ago
- Examples illustrating usage of the rocBLAS library☆17Aug 12, 2024Updated last year
- Source code of the PPoPP '22 paper: "TileSpGEMM: A Tiled Algorithm for Parallel Sparse General Matrix-Matrix Multiplication on GPUs" by Y…☆46May 22, 2024Updated last year
- Source code "Unsupervised Model Personalization while Preserving Privacy and Scalability: An Open Problem." @ CVPR2020☆12Dec 8, 2022Updated 3 years ago
- Implementation of 3d non-separable convolution using CUDA & FFT Convolution☆20Jan 15, 2019Updated 7 years ago
- Sparsity support for PyTorch☆38Mar 22, 2025Updated last year
- ☆113Jul 3, 2021Updated 4 years ago
- ☆63Jul 21, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆19Nov 5, 2024Updated last year
- ☆11Dec 5, 2018Updated 7 years ago
- ☆21Apr 13, 2022Updated 4 years ago
- A library of GPU kernels for sparse matrix operations.☆286Nov 24, 2020Updated 5 years ago
- ☆19Jul 30, 2024Updated last year
- Automatic Detection Of Photovoltaic Panels Through Remote Sensing☆16Oct 3, 2020Updated 5 years ago
- 将MNN拆解的简易前向推理框架(for study!)☆24Feb 21, 2021Updated 5 years ago
- Winograd minimal convolution algorithm generator for convolutional neural networks.☆627Feb 9, 2026Updated 2 months ago
- ☆120Apr 11, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 🐱 ncnn int8 模型量化评估☆14Oct 10, 2022Updated 3 years ago
- ☆10Feb 1, 2022Updated 4 years ago
- ☆41Apr 3, 2022Updated 4 years ago
- image to column☆30Jul 15, 2014Updated 11 years ago
- GEMM by WMMA (tensor core)☆15Jul 31, 2022Updated 3 years ago
- ☆10Apr 24, 2023Updated 2 years ago
- MagmaDNN: a simple deep learning framework in c++☆52Aug 21, 2020Updated 5 years ago