CisMine / Setup-as-Cuda-programmersLinks

Setup Cuda

☆22

Alternatives and similar repositories for Setup-as-Cuda-programmers

Users that are interested in Setup-as-Cuda-programmers are comparing it to the libraries listed below

Sorting:

CisMine / Parallel-Computing-Cuda-C
CUDA Learning guide
☆395Updated last year
CisMine / Guide-NVIDIA-Tools
NVIDIA tools guide
☆135Updated 5 months ago
ThoenigAdrian / NeuralNetworksCudaTutorial
Implement Neural Networks in Cuda from Scratch
☆23Updated last year
leokruglikov / CUDA-notes
Personal notes on CUDA programming
☆55Updated 2 years ago
drkennetz / cuda_examples
Some CUDA example code with READMEs.
☆165Updated 3 months ago
mikeroyal / CUDA-Guide
CUDA Guide
☆66Updated last year
SzymonOzog / FastSoftmax
☆38Updated 5 months ago
alexarmbr / matmul-playground
☆11Updated 2 months ago
BobMcDear / neural-network-cuda
Neural network from scratch in CUDA/C++
☆80Updated 5 months ago
RichardAns / CUDA-Programs
Examples from Programming in Parallel with CUDA
☆153Updated 2 years ago
CisMine / Setup_dataset
Read custom dataset
☆11Updated 2 years ago
salykova / sgemm.cu
High-Performance SGEMM on CUDA devices
☆95Updated 5 months ago
stanford-cs149 / cs149gpt
☆72Updated last year
gpu-mode / profiling-cuda-in-torch
☆159Updated last year
Infatoshi / mnist-cuda
☆265Updated 5 months ago
SzymonOzog / GPU_Programming
☆59Updated this week
vdesai2014 / inference-optimization-blog-post
☆88Updated last year
saurabhaloneai / Llama-3-From-Scratch-In-Pure-Jax
This repository contain the simple llama3 implementation in pure jax.
☆66Updated 4 months ago
rkaehn / gpt-2
GPT-2 in C
☆71Updated 5 months ago
lweitkamp / GANs-JAX
Implementation of several Generative Adversarial Networks in JAX / Flax
☆34Updated 3 years ago
tensara / tensara
Competitive GPU kernel optimization platform.
☆79Updated last week
CisMine / GPU-in-ML-DL
Apply GPU in ML and DL
☆52Updated 4 months ago
lessw2020 / triton_kernels_for_fun_and_profit
Custom kernels in Triton language for accelerating LLMs
☆22Updated last year
gevtushenko / llm.c
LLM training in simple, raw C/CUDA
☆99Updated last year
loganwatchorn / notes-pmpp
Notes on "Programming Massively Parallel Processors" by Hwu, Kirk, and Hajj (4th ed.)
☆53Updated 10 months ago
a-hamdi / GPU
100 days of building GPU kernels!
☆445Updated last month
jax-ml / scaling-book
Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs
☆399Updated 2 weeks ago
Maharshi-Pandya / cudacodes
Learnings and programs related to CUDA
☆407Updated 4 months ago
stas00 / ml-ways
ML/DL Math and Method notes
☆61Updated last year
EzgiKorkmaz / generalization-reinforcement-learning
A Survey Analyzing Generalization in Deep Reinforcement Learning
☆34Updated 7 months ago