implementation of winograd minimal convolution algorithm on Intel Architecture
☆39Dec 4, 2017Updated 8 years ago
Alternatives and similar repositories for Winconv
Users that are interested in Winconv are comparing it to the libraries listed below
Sorting:
- ☆26Dec 1, 2016Updated 9 years ago
- A Winograd based kernel for convolutions in deep learning framework☆15Jul 22, 2017Updated 8 years ago
- I'm going to use the Winograd’s minimal filtering algorithms to introduce a new class of fast algorithms for convolutional neural networks…☆12Mar 22, 2018Updated 7 years ago
- GPU implementation of Winograd convolution☆10Oct 23, 2017Updated 8 years ago
- Accelerating CNN's convolution operation on GPUs by using memory-efficient data access patterns.☆14Dec 8, 2017Updated 8 years ago
- Winograd minimal convolution algorithm generator for convolutional neural networks.☆627Feb 9, 2026Updated 2 weeks ago
- Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025☆38Feb 24, 2025Updated last year
- Fast CUDA Kernels for ResNet Inference.☆182May 26, 2019Updated 6 years ago
- Benchmark for matrix multiplications between dense and block sparse (BSR) matrix in TVM, blocksparse (Gray et al.) and cuSparse.☆23Aug 21, 2020Updated 5 years ago
- Efficient Sparse-Winograd Convolutional Neural Networks (ICLR 2018)☆193May 7, 2019Updated 6 years ago
- ☆32Aug 24, 2022Updated 3 years ago
- Retrieves the top 10 documents from the Wikipedia corpus for a user inputted free-text query☆10Nov 24, 2020Updated 5 years ago
- ☆13Jan 28, 2026Updated last month
- HPA2021 solution (3rd place)☆10Oct 13, 2021Updated 4 years ago
- ☆46Jun 19, 2024Updated last year
- YOLOV3-Tiny TensorRT6.0 13个类别☆32Apr 9, 2020Updated 5 years ago
- ☆10Mar 18, 2020Updated 5 years ago
- Библиотека работы с датчиками (АЦП) HX711 для Arduino☆13Apr 15, 2024Updated last year
- Artifact for 'Register Optimizations for Stencils on GPUs'☆10Sep 18, 2018Updated 7 years ago
- Directed masked autoencoders☆14Feb 20, 2026Updated last week
- Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.☆91Nov 23, 2022Updated 3 years ago
- AMD Software Development Kit 2.5 Sources☆10Feb 29, 2016Updated 10 years ago
- Github老玩家自己搭的服务器,老飞飞原版,可联机-天马座☆11May 14, 2019Updated 6 years ago
- SQL Optimizations using MLIR☆12Apr 5, 2020Updated 5 years ago
- Makes GNOME's topbar's background gradient.☆11Feb 7, 2026Updated 3 weeks ago
- Privacy-Preserving Multiple Tensor Factorization for Synthesizing Large-Scale Location Traces☆14Sep 14, 2021Updated 4 years ago
- Manipulate tensors with PackedSequence and CattedSequence☆12Jan 4, 2026Updated last month
- A polyphonic music transcription Vamp plugin☆10Nov 20, 2019Updated 6 years ago
- The matlab code of Sparse Contextual Activation (SCA) published in TIP 2016☆10Mar 18, 2018Updated 7 years ago
- ☆12Mar 13, 2023Updated 2 years ago
- Matlab implementation of the CS video reconstruction method RRS☆11May 21, 2018Updated 7 years ago
- C# wrapper for msnhnet.☆10Aug 14, 2020Updated 5 years ago
- GPU-accelerated AES encryption project☆11Feb 13, 2015Updated 11 years ago
- Modular, flexible, cross-platform workload profiling and characterization☆13Mar 1, 2021Updated 4 years ago
- THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.☆11May 6, 2023Updated 2 years ago
- Tutorial, examples and regression tests for Coriolis & Alliance (LIP6)☆15Jan 5, 2026Updated last month
- The VD100 development board is based on the Xilinx Versal AI Edge series chip xcve2302 and is designed with a core board and a bottom boa…☆18Jul 9, 2024Updated last year
- ☆11Aug 23, 2015Updated 10 years ago
- Tenstorrent Topology (TT-Topology) is a command line utility used to flash multiple NB cards on a system to use specific eth routing conf…☆16Feb 18, 2026Updated last week