Accelerating CNN's convolution operation on GPUs by using memory-efficient data access patterns.
☆14Dec 8, 2017Updated 8 years ago
Alternatives and similar repositories for convolution_kernel
Users that are interested in convolution_kernel are comparing it to the libraries listed below
Sorting:
- GPU implementation of Winograd convolution☆10Oct 23, 2017Updated 8 years ago
- ICML2017 MEC: Memory-efficient Convolution for Deep Neural Network C++实现(非官方)☆17Apr 9, 2019Updated 6 years ago
- Different implementation of sparse matrix multiplication. All matrices are in CSR format. The code contains different CUDA kernels for mu…☆17Nov 15, 2010Updated 15 years ago
- Accelerate convolution neural network for face recognition using GPU☆13Nov 24, 2020Updated 5 years ago
- A PYNQ overlay demonstrating the Xilinx RFSoC SD-FEC☆13Jun 29, 2022Updated 3 years ago
- CUDA project for uni subject☆26Oct 26, 2020Updated 5 years ago
- A Winograd Minimal Filter Implementation in CUDA☆28Aug 25, 2021Updated 4 years ago
- Record GPU memory accesses of a CUDA program and visualize the access pattern in a browser☆13Nov 17, 2020Updated 5 years ago
- ☆11Dec 5, 2018Updated 7 years ago
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆12Jun 24, 2024Updated last year
- Autonomous Driving Research and Educational Platform☆15Dec 22, 2021Updated 4 years ago
- 无人车感知组的技术文章,教程☆18Jan 17, 2019Updated 7 years ago
- Speech recognition with federated learning☆11Jan 9, 2020Updated 6 years ago
- image to column☆30Jul 15, 2014Updated 11 years ago
- Google Earth Pro image extractor and alignment☆13Feb 9, 2018Updated 8 years ago
- Fast CUDA Kernels for ResNet Inference.☆182May 26, 2019Updated 6 years ago
- Check if two polygons overlap☆10Dec 19, 2015Updated 10 years ago
- ☆14Jul 15, 2018Updated 7 years ago
- benchmarking miopen☆17Jan 14, 2019Updated 7 years ago
- Minimal implementation of Contrastive Predictive Coding for audio.☆17Nov 17, 2019Updated 6 years ago
- Vscode extension -- show GPU activities on status bar☆14Jun 25, 2019Updated 6 years ago
- ☆13Feb 5, 2022Updated 4 years ago
- Dynamic Control Flow Recovery☆25Apr 15, 2018Updated 7 years ago
- Migrate Xilinx edge AI solution to PYNQ☆17Nov 3, 2020Updated 5 years ago
- ☆14Feb 5, 2018Updated 8 years ago
- An HPL-AI implementation for Fugaku☆23Jun 29, 2021Updated 4 years ago
- Implementation of End-To-End Memory Networks with Tensorflow for bAbI Dataset☆11Aug 17, 2017Updated 8 years ago
- Customized matrix multiplication kernels☆57Mar 5, 2022Updated 4 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Jun 27, 2020Updated 5 years ago
- A FPGA accelerated SDR receiver using PYNQ-Z2 board and RTL-SDR☆22Oct 22, 2019Updated 6 years ago
- A generic code base for neural network pruning, especially for pruning at initialization.☆31Sep 3, 2022Updated 3 years ago
- 一种基于FPGA平台的实时视频去雾系统项目代码,其中bit流文件可以直接下载到PYNQ-Z2开发板上,通过usb和hdmi设备输入有雾视频,将去雾后的视频输出到显示屏上。c++源代码部分是我们的去雾IP核的源代码。☆20Nov 24, 2019Updated 6 years ago
- Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.☆23Dec 27, 2019Updated 6 years ago
- Materials for ECS 201A☆11Oct 23, 2019Updated 6 years ago
- ☆23Nov 30, 2018Updated 7 years ago
- A app for the BFM data generation☆12Apr 23, 2019Updated 6 years ago
- Securing Data Analytics on Intel SGX using Randomization☆13Aug 30, 2017Updated 8 years ago
- The matlab code of Sparse Contextual Activation (SCA) published in TIP 2016☆10Mar 18, 2018Updated 8 years ago
- MATLAB/Octave generator of Hamming ECC coding. Output format is Verilog HDL.☆12Dec 27, 2022Updated 3 years ago