chenxuhao/caffe-escoin

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chenxuhao/caffe-escoin)

chenxuhao / caffe-escoin

Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs

☆16

Alternatives and similar repositories for caffe-escoin

Users that are interested in caffe-escoin are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

YusukeNagasaka / Batched-SpMM
View on GitHub
New batched algorithm for sparse matrix-matrix multiplication (SpMM)
☆16May 7, 2019Updated 7 years ago
marsupialtail / gpu-sparsert
View on GitHub
☆18Oct 15, 2020Updated 5 years ago
shamanDevel / cuMat
View on GitHub
An expression template based linear algebra library running completely on the GPU using CUDA
☆26Jun 24, 2021Updated 5 years ago
maltanar / spmv-vector-cache
View on GitHub
A Vector Caching Scheme for Streaming FPGA SpMV Accelerators
☆10Sep 7, 2015Updated 10 years ago
codyjrivera / tsm2x-imp
View on GitHub
Implementation of TSM2L and TSM2R -- High-Performance Tall-and-Skinny Matrix-Matrix Multiplication Algorithms for CUDA
☆35Jul 28, 2020Updated 5 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
eth-cscs / spla
View on GitHub
Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceler…
☆32Jun 26, 2024Updated 2 years ago
GPUPeople / ACSpGEMM
View on GitHub
Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"
☆31Jul 7, 2020Updated 6 years ago
chenxuhao / gardenia
View on GitHub
GARDENIA: Graph Analytics Repository for Designing Efficient Next-generation Accelerators
☆34Apr 3, 2022Updated 4 years ago
pkestene / ppkMHD
View on GitHub
MPI+Kokkos implementation of spectral difference method (SDM) high order schemes
☆30Feb 2, 2025Updated last year
owensgroup / merge-spmm
View on GitHub
Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018
☆74Oct 5, 2020Updated 5 years ago
hclhkbu / gcoospdm
View on GitHub
Sparse-dense matrix-matrix multiplication on GPUs
☆14Oct 15, 2018Updated 7 years ago
lixiuhong / batched_gemm
View on GitHub
☆40Feb 28, 2020Updated 6 years ago
poojahira / spmv-cuda
View on GitHub
Implementation and analysis of five different GPU based SPMV algorithms in CUDA
☆39Feb 5, 2019Updated 7 years ago
Bruce-Lee-LY / cuda_back2back_hgemm
View on GitHub
Use tensor core to calculate back-to-back HGEMM (half-precision general matrix multiplication) with MMA PTX instruction.
☆13Nov 3, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
apuaaChen / vectorSparse
View on GitHub
☆32Aug 24, 2022Updated 3 years ago
XiuYuLi / flexible-gemm
View on GitHub
flexible-gemm conv of deepcore
☆17Dec 2, 2019Updated 6 years ago
danghvu / cudaSpmv
View on GitHub
CUDA Sparse-Matrix Vector Multiplication, using Sliced Coordinate format
☆22Jun 8, 2018Updated 8 years ago
libocca / occa.py
View on GitHub
OCCA Python API: JIT Compilation for Multiple Architectures
☆11Dec 20, 2019Updated 6 years ago
XiuYuLi / deepcore_source_code
View on GitHub
Subpart source code of of deepcore v0.7
☆27Jun 28, 2020Updated 6 years ago
GPUPeople / spECK
View on GitHub
Efficient SpGEMM on GPU using CUDA and CSR
☆61Jul 18, 2023Updated 3 years ago
EkdeepSLubana / flowandprune
View on GitHub
Codebase for the paper "A Gradient Flow Framework for Analyzing Network Pruning"
☆20Jan 31, 2021Updated 5 years ago
slongle / GPU-Renderer
View on GitHub
Offline renderer using CUDA
☆13Jun 8, 2020Updated 6 years ago
matiaslindgren / cuda-memory-access-recorder
View on GitHub
Record GPU memory accesses of a CUDA program and visualize the access pattern in a browser
☆13Nov 17, 2020Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
hgyhungry / ge-spmm
View on GitHub
☆115Jul 3, 2021Updated 5 years ago
md2z34 / winograd_gpu
View on GitHub
GPU implementation of Winograd convolution
☆10Oct 23, 2017Updated 8 years ago
josehu07 / cuckoo-hashing-CUDA
View on GitHub
Parallel cuckoo hashing on GPUs with CUDA
☆12Sep 27, 2019Updated 6 years ago
pigirons / spmv
View on GitHub
This is a tuned sparse matrix dense vector multiplication(SpMV) library
☆23Mar 21, 2016Updated 10 years ago
PolyArch / stream-dataflow
View on GitHub
Public Release of Stream-Dataflow
☆14May 17, 2019Updated 7 years ago
flame / tblis-strassen
View on GitHub
Strassen's Algorithm for Tensor Contraction
☆15Jul 7, 2017Updated 9 years ago
nullplay / Unified-Convolution-Framework
View on GitHub
☆10Apr 24, 2023Updated 3 years ago
baidu-research / catamount
View on GitHub
Catamount is a compute graph analysis tool to load, construct, and modify deep learning models and to symbolically analyze their compute …
☆14May 18, 2021Updated 5 years ago
YashasSamaga / ConvolutionBuildingBlocks
View on GitHub
GEMM and Winograd based convolutions using CUTLASS
☆28Jul 15, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
j-levy / bwa-gasal2
View on GitHub
BWA-MEM program accelerated with the GASAL2 library
☆19Sep 2, 2019Updated 6 years ago
Liu-Cheng / graph_accelerator
View on GitHub
Graph accelerator on FPGAs and ASICs
☆11Aug 16, 2018Updated 7 years ago
SuperScientificSoftwareLaboratory / TileSpGEMM
View on GitHub
Source code of the PPoPP '22 paper: "TileSpGEMM: A Tiled Algorithm for Parallel Sparse General Matrix-Matrix Multiplication on GPUs" by Y…
☆48May 22, 2024Updated 2 years ago
amallia / gpu-integers-compression
View on GitHub
GPU-Accelerated Faster Decoding of Integer Lists
☆13Aug 20, 2019Updated 6 years ago
ishanhan / parallel-implementation-of-kmeans
View on GitHub
Parallel implementation of k-means clustering using MPI4PY and PyCUDA.
☆10Mar 11, 2019Updated 7 years ago
budlbaram / tiny_imagenet
View on GitHub
model learning and test for tiny-imageNet
☆25Oct 19, 2017Updated 8 years ago
temporal-hpc / reduction-tensor-cores
View on GitHub
Fast GPU based tensor core reductions
☆12Jan 13, 2023Updated 3 years ago