quettabit/convolution_kernel

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/quettabit/convolution_kernel)

quettabit / convolution_kernel

Accelerating CNN's convolution operation on GPUs by using memory-efficient data access patterns.

☆14

Alternatives and similar repositories for convolution_kernel

Users that are interested in convolution_kernel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

md2z34 / winograd_gpu
View on GitHub
GPU implementation of Winograd convolution
☆10Oct 23, 2017Updated 8 years ago
CSshengxy / MEC
View on GitHub
ICML2017 MEC: Memory-efficient Convolution for Deep Neural Network C++实现(非官方)
☆17Apr 9, 2019Updated 7 years ago
ksopyla / CudaDotProd
View on GitHub
Different implementation of sparse matrix multiplication. All matrices are in CSR format. The code contains different CUDA kernels for mu…
☆17Nov 15, 2010Updated 15 years ago
WalkerLau / GPU-CNN
View on GitHub
Accelerate convolution neural network for face recognition using GPU
☆15Nov 24, 2020Updated 5 years ago
Xilinx / SDFEC-PYNQ
View on GitHub
A PYNQ overlay demonstrating the Xilinx RFSoC SD-FEC
☆13Jun 29, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
piojanu / CUDA-im2col-conv
View on GitHub
CUDA project for uni subject
☆26Oct 26, 2020Updated 5 years ago
UDC-GAC / openCNN
View on GitHub
A Winograd Minimal Filter Implementation in CUDA
☆31Aug 25, 2021Updated 4 years ago
matiaslindgren / cuda-memory-access-recorder
View on GitHub
Record GPU memory accesses of a CUDA program and visualize the access pattern in a browser
☆13Nov 17, 2020Updated 5 years ago
NaoyukiIchimura / cuda_image_filtering_global
View on GitHub
☆11Dec 5, 2018Updated 7 years ago
ROCm / AITemplate
View on GitHub
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…
☆12Jun 24, 2024Updated 2 years ago
streamer-AP / HRT19D-detection
View on GitHub
无人车感知组的技术文章，教程
☆18Jan 17, 2019Updated 7 years ago
wutianze / HydraMini
View on GitHub
Autonomous Driving Research and Educational Platform
☆15Dec 22, 2021Updated 4 years ago
nelson-liu / website
View on GitHub
☆13Feb 5, 2022Updated 4 years ago
zhanglei1949 / federatedSpeechCommands
View on GitHub
Speech recognition with federated learning
☆11Jan 9, 2020Updated 6 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
xuqiantong / CUDA-Winograd
View on GitHub
Fast CUDA Kernels for ResNet Inference.
☆183May 26, 2019Updated 7 years ago
So-Cool / xCave
View on GitHub
Google Earth Pro image extractor and alignment
☆13Feb 9, 2018Updated 8 years ago
erf / polygon-overlap
View on GitHub
Check if two polygons overlap
☆10Dec 19, 2015Updated 10 years ago
RIKEN-RCCS / hpl-ai
View on GitHub
An HPL-AI implementation for Fugaku
☆24Jun 29, 2021Updated 5 years ago
scottlinlin / auto_feature_demo
View on GitHub
☆14Jul 15, 2018Updated 8 years ago
pat-coady / contrast-pred-code
View on GitHub
Minimal implementation of Contrastive Predictive Coding for audio.
☆18Nov 17, 2019Updated 6 years ago
patflick / miopen-benchmark
View on GitHub
benchmarking miopen
☆17Jan 14, 2019Updated 7 years ago
innerlee / nvidia-smi
View on GitHub
Vscode extension -- show GPU activities on status bar
☆14Jun 25, 2019Updated 7 years ago
louisliuwei / pynq-dpu
View on GitHub
Migrate Xilinx edge AI solution to PYNQ
☆17Nov 3, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
toshipiazza / drcfg
View on GitHub
Dynamic Control Flow Recovery
☆25Apr 15, 2018Updated 8 years ago
JackBurdick / ASR_DL
View on GitHub
☆13Feb 5, 2018Updated 8 years ago
1202kbs / MemN2N-Tensorflow
View on GitHub
Implementation of End-To-End Memory Networks with Tensorflow for bAbI Dataset
☆11Aug 17, 2017Updated 8 years ago
DeMoriarty / custom_matmul_kernels
View on GitHub
Customized matrix multiplication kernels
☆57Mar 5, 2022Updated 4 years ago
wali-ku / BWLOCK-GPU
View on GitHub
Protecting Real-Time GPU Kernels on Integrated CPU-GPU SoC Platforms
☆12Apr 9, 2018Updated 8 years ago
archiki / ASR-Accent-Analysis
View on GitHub
Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.
☆15Jun 27, 2020Updated 6 years ago
wgq18 / img_defog
View on GitHub
一种基于FPGA平台的实时视频去雾系统项目代码，其中bit流文件可以直接下载到PYNQ-Z2开发板上，通过usb和hdmi设备输入有雾视频，将去雾后的视频输出到显示屏上。c++源代码部分是我们的去雾IP核的源代码。
☆20Nov 24, 2019Updated 6 years ago
MingSun-Tse / Smile-Pruning
View on GitHub
A generic code base for neural network pruning, especially for pruning at initialization.
☆32Sep 3, 2022Updated 3 years ago
marsiau / PYNQ-RTL-SDR
View on GitHub
A FPGA accelerated SDR receiver using PYNQ-Z2 board and RTL-SDR
☆23Oct 22, 2019Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
rusiaaman / PCPM
View on GitHub
Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
☆23Dec 27, 2019Updated 6 years ago
jlpteaching / ECS201A
View on GitHub
Materials for ECS 201A
☆11Oct 23, 2019Updated 6 years ago
xupsh / pynq-supported-board-file
View on GitHub
☆24Nov 30, 2018Updated 7 years ago
dengzelu / semantic-segmentation-pytorch
View on GitHub
semantic segmentation using pytorch
☆11Dec 1, 2017Updated 8 years ago
LucienXian / octave_run_BFM_docker
View on GitHub
A app for the BFM data generation
☆12Apr 23, 2019Updated 7 years ago
iDoka / hdl-secded-producer
View on GitHub
MATLAB/Octave generator of Hamming ECC coding. Output format is Verilog HDL.
☆12Dec 27, 2022Updated 3 years ago
SongBaiHust / Sparse-Contextual-Activation
View on GitHub
The matlab code of Sparse Contextual Activation (SCA) published in TIP 2016
☆10Mar 18, 2018Updated 8 years ago