chaolongzhang / algorithms-cudaLinks

parallel algorithm based on cuda

☆60

Alternatives and similar repositories for algorithms-cuda

Users that are interested in algorithms-cuda are comparing it to the libraries listed below

Sorting:

zhxfl / CUDA-CNN
CNN accelerated by cuda. Test on mnist and finilly get 99.76%
☆186Updated 7 years ago
xuqiantong / CUDA-Winograd
Fast CUDA Kernels for ResNet Inference.
☆177Updated 6 years ago
boxvc / NVIDIA-Jobs
Deep Learning/GPU Architect/Autonomous Driving Positions
☆80Updated 5 years ago
seetaresearch / dragon
A Computation Graph Virtual Machine based ML Framework
☆108Updated last year
hyln9 / GCNGEMM
Optimized half precision gemm assembly kernels (deprecated due to ROCm)
☆47Updated 8 years ago
XiuYuLi / flexible-gemm
flexible-gemm conv of deepcore
☆17Updated 5 years ago
yuxianzhi / Top-K
A way to use cuda to accelerate top k algorithm
☆29Updated 8 years ago
yanqswhu / cuda_by_example
The CMake version of cuda_by_example
☆148Updated 4 years ago
vinx13 / tvm-cuda-int8-benchmark
Benchmark of TVM quantized model on CUDA
☆111Updated 5 years ago
CSshengxy / MEC
ICML2017 MEC: Memory-efficient Convolution for Deep Neural Network C++实现(非官方)
☆17Updated 6 years ago
mz24cn / clnet
OpenCL for Nets - A Deep Learning Framework based on OpenCL, written by C++. Supports popular MLP, RNN(LSTM), CNN(ResNet). Friendly debug…
☆68Updated 6 years ago
cwlacewe / netscope
This is a CNN Analyzer tool, based on Netscope by dgschwend/netscope
☆42Updated 7 years ago
xingyul / sparse-winograd-cnn
Efficient Sparse-Winograd Convolutional Neural Networks (ICLR 2018)
☆191Updated 6 years ago
OrangeOwlSolutions / General-CUDA-programming
☆44Updated 7 years ago
matazure / mtensor
a c++/cuda template library for tensor lazy evaluation
☆161Updated 2 years ago
ZihaoZhao / CUDA_study
☆45Updated 5 years ago
Caffe-MPI / Caffe-MPI.github.io
☆125Updated 7 years ago
wykvictor / cs344-cuda-udacity
Windows Visual Studio Solutions for class "Introduction to Parallel Programming"
☆19Updated 6 years ago
bcaine / nn_cpp
A minimalistic header only C++11 Neural Network library based on Eigen::Tensor
☆20Updated 7 years ago
ravi-teja-mullapudi / Halide-NN
CNNs in Halide
☆23Updated 9 years ago
zhiqi-0 / RDMA-MXNet-ps-lite
RDMA Optimization on MXNet
☆14Updated 7 years ago
zhaoweicai / hwgq
Caffe implementation of accurate low-precision neural networks
☆117Updated 6 years ago
xylcbd / EasyCNN
easy convolution neural network
☆165Updated 3 years ago
LitLeo / OpenCUDA
☆263Updated 7 years ago
keithyin / read-pytorch-source-code
pytorch源码阅读 0.2.0 版本
☆90Updated 5 years ago
IntelLabs / SkimCaffe
Caffe for Sparse Convolutional Neural Network
☆238Updated 2 years ago
masahi / tvm-winograd
Test winograd convolution written in TVM for CUDA and AMDGPU
☆41Updated 6 years ago
fengbingchun / CUDA_Test
CUDA/SIMD/AssemblyLanguage/OpenMP/Eigen's usage
☆105Updated 2 years ago
merrymercy / tvm-mali
Optimizing Mobile Deep Learning on ARM GPU with TVM
☆181Updated 6 years ago
XiuYuLi / deepcore_source_code
Subpart source code of of deepcore v0.7
☆27Updated 5 years ago