pwlnk / cuda-neural-networkLinks

Simple neural network implementation using CUDA technology. It is an educational implementation.

☆97

Alternatives and similar repositories for cuda-neural-network

Users that are interested in cuda-neural-network are comparing it to the libraries listed below

Sorting:

lzhengchun / matrix-cuda
matrix multiplication in CUDA
☆123Updated last year
leimao / CUDA-GEMM-Optimization
CUDA Matrix Multiplication Optimization
☆214Updated last year
wzsh / wmma_tensorcore_sample
Matrix Multiply-Accumulate with CUDA and WMMA( Tensor Core)
☆138Updated 4 years ago
cwpearson / nvidia-performance-tools
Instructions, Docker images, and examples for Nsight Compute and Nsight Systems
☆131Updated 5 years ago
deeperlearning / professional-cuda-c-programming
☆453Updated 10 years ago
eegkno / CUDA_by_practice
CUDA by practice
☆129Updated 5 years ago
CoffeeBeforeArch / cuda_programming
Code from the "CUDA Crash Course" YouTube series by CoffeeBeforeArch
☆852Updated 2 years ago
CodedK / CUDA-by-Example-source-code-for-the-book-s-examples-
CUDA by Example, written by two senior members of the CUDA software platform team, shows programmers how to employ this new technology. …
☆433Updated 2 years ago
NVIDIA / nsight-training
Training material for Nsight developer tools
☆163Updated last year
wangzyon / NVIDIA_SGEMM_PRACTICE
Step-by-step optimization of CUDA SGEMM
☆363Updated 3 years ago
Hardware-Alchemy / cuDNN-sample
cuDNN sample codes provided by Nvidia
☆46Updated 6 years ago
andreinechaev / nvcc4jupyter
A plugin for Jupyter Notebook to run CUDA C/C++ code
☆238Updated 10 months ago
BobMcDear / neural-network-cuda
Neural network from scratch in CUDA/C++
☆83Updated 6 months ago
olcf / cuda-training-series
Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)
☆831Updated 11 months ago
google-research / sputnik
A library of GPU kernels for sparse matrix operations.
☆270Updated 4 years ago
siboehm / SGEMM_CUDA
Fast CUDA matrix multiplication from scratch
☆786Updated last year
jaredhoberock / stanford-cs193g-sp2010
This is an archive of materials produced for an introductory class on CUDA programming at Stanford University in 2010
☆220Updated 3 years ago
R100001 / Programming-Massively-Parallel-Processors
☆173Updated last year
PacktPublishing / Learn-CUDA-Programming
Learn CUDA Programming, published by Packt
☆1,173Updated last year
NVIDIA / multi-gpu-programming-models
Examples demonstrating available options to program multiple GPUs in a single node or a cluster
☆768Updated 5 months ago
yzhaiustc / Optimizing-SGEMM-on-NVIDIA-Turing-GPUs
Optimizing SGEMM kernel functions on NVIDIA GPUs to a close-to-cuBLAS performance.
☆370Updated 7 months ago
RichardAns / CUDA-Programs
Examples from Programming in Parallel with CUDA
☆158Updated 2 years ago
Huanghongru / SGEMM-Implementation-and-Optimization
Some source code about matrix multiplication implementation on CUDA
☆34Updated 6 years ago
leimao / CUTLASS-Examples
CUTLASS and CuTe Examples
☆68Updated 3 weeks ago
puttsk / cuda-tutorial
A set of hands-on tutorials for CUDA programming
☆230Updated last year
Cjkkkk / CUDA_gemm
A simple high performance CUDA GEMM implementation.
☆392Updated last year
zchee / cuda-sample
CUDA official sample codes
☆372Updated 9 years ago
paramhanji / CUDA-CNN
Implementation of a simple CNN using CUDA
☆68Updated 8 years ago
mark-poscablo / gpu-sum-reduction
CUDA implementation of the fundamental sum reduce operation. Aims to be as optimized as reasonable.
☆37Updated 8 years ago
CisMine / Guide-NVIDIA-Tools
NVIDIA tools guide
☆144Updated 7 months ago