sgiraz/CUDA-Training

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sgiraz/CUDA-Training)

sgiraz / CUDA-Training

Some CUDA projects and utility

☆27

Alternatives and similar repositories for CUDA-Training

Users that are interested in CUDA-Training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dawn-chu / EECS-368-Programming-Massively-Parallel-Processors-with-CUDA
View on GitHub
☆19May 17, 2016Updated 10 years ago
bsc-quantic / EinExprs.jl
View on GitHub
Einsum Expressions in Julia
☆14Aug 2, 2025Updated 11 months ago
TensorBFS / CuTropicalGEMM.jl
View on GitHub
The fastest Tropical number matrix multiplication on GPU
☆10Aug 23, 2025Updated 10 months ago
GiggleLiu / YaoTutorial
View on GitHub
A tutorial for Yao.jl
☆11Oct 9, 2023Updated 2 years ago
GiggleLiu / ScientificComputingForPhysicists
View on GitHub
A scientific computing book for physicists, with Julia programming language
☆19Jan 29, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
GiggleLiu / ProblemReductions.jl
View on GitHub
Reduction between computational hard problems.
☆13Nov 24, 2025Updated 7 months ago
CFDML / KitAMR.jl
View on GitHub
A massively parallel distributed computational fluid dynamics facility with adaptive mesh refinement.
☆22Jun 28, 2026Updated 3 weeks ago
marcosamaris / gpuperfpredict
View on GitHub
Predict Performance of GPU Applications using analytical model and Machine Learning
☆11Aug 31, 2022Updated 3 years ago
stecrotti / BeliefPropagation.jl
View on GitHub
The Belief Propagation approximation for probability distributions on sparse graphs
☆26Jun 23, 2026Updated 3 weeks ago
xuanzhaogao / TreeWidthSolver.jl
View on GitHub
Implementation of the tree width algorithms.
☆19Nov 24, 2025Updated 7 months ago
TensorBFS / TensorInference.jl
View on GitHub
Probabilistic inference using contraction of tensor networks
☆25Sep 9, 2025Updated 10 months ago
nvixnu / pmpp__programming_massively_parallel_processors
View on GitHub
Examples and exercises from the book Programming Massively Parallel Processors - A Hands-on Approach. David B. Kirk and Wen-mei W. Hwu (T…
☆79Jan 21, 2021Updated 5 years ago
GiggleLiu / cryochamber
View on GitHub
Cryochamber for your AI agents, for scheduling long running tasks
☆29Jul 8, 2026Updated last week
GiggleLiu / LuxorGraphPlot.jl
View on GitHub
A minimum Luxor backended graph visualization package.
☆10Aug 15, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
msussman42 / amrex_implicit_interfaces
View on GitHub
☆11Updated this week
GiggleLiu / ScientificComputingDemos
View on GitHub
Demos for book: Scientific computing for physicists.
☆35Nov 6, 2025Updated 8 months ago
jerhoud / TensorMixedStates.jl
View on GitHub
A julia library to simulate quantum mixed states and Lindblad equation using matrix product states
☆27Updated this week
sandialabs / CSPlib
View on GitHub
Computational singular perturbation analysis library
☆12Sep 11, 2025Updated 10 months ago
nzy1997 / qec-thrust
View on GitHub
☆11Apr 16, 2026Updated 3 months ago
KarhouTam / cuda-kernels
View on GitHub
Some common CUDA kernel implementations (Not the fastest).
☆30Jun 24, 2026Updated 3 weeks ago
olcf-tutorials / local_mpi_to_gpu
View on GitHub
How to use node-local MPI rank IDs to manually map MPI ranks to GPUs
☆14Apr 22, 2020Updated 6 years ago
marlam / gencolormap
View on GitHub
color map generator for scientific visualization
☆12Nov 12, 2025Updated 8 months ago
optimisan / llvm-mips-backend
View on GitHub
Tutorial for writing an LLVM backend
☆33May 19, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Jutho / CuTensorOperations.jl
View on GitHub
TensorOperations and cuTENSOR combined
☆13Nov 26, 2019Updated 6 years ago
AMReX-Combustion / PeleAnalysis
View on GitHub
A collection of processing tools for reacting flow simulations with AMReX-based CFD tools. See https://peleanalysis.readthedocs.io/en/la…
☆15Jul 1, 2026Updated 2 weeks ago
blueCFD / PyFoam
View on GitHub
Porting PyFoam to Windows. Please read the file "README.Windows".
☆14Jan 18, 2015Updated 11 years ago
AlgebraicJulia / CliqueTrees.jl
View on GitHub
A Julia library for computing tree decompositions and chordal completions of graphs.
☆37Jun 30, 2026Updated 3 weeks ago
luohancfd / py2tec
View on GitHub
python to tecplot
☆11Oct 12, 2020Updated 5 years ago
GiggleLiu / ModernScientificComputing
View on GitHub
Course AMAT5315: Advanced scientific computing, the website and Julia notebooks
☆36Dec 26, 2023Updated 2 years ago
HAWAIILAB / cuda-flux
View on GitHub
CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels
☆33Mar 15, 2021Updated 5 years ago
jonaslindemann / guide_to_python
View on GitHub
Source code for the book "Ingenjörens guide till Python"
☆17Apr 19, 2026Updated 3 months ago
tpapp / LogDensityProblemsAD.jl
View on GitHub
AD backends for LogDensityProblems.jl.
☆13Jul 1, 2026Updated 2 weeks ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AlgebraicJulia / StructuredDecompositions.jl
View on GitHub
Structured decompositions!
☆15Mar 26, 2025Updated last year
MartinMikkelsen / FewBodyPhysics.jl
View on GitHub
Quantum mechanical few body systems in Julia
☆11Jan 6, 2026Updated 6 months ago
CoffeeBeforeArch / spring_2020_tutorial
View on GitHub
"Hardware, Software, and Compilers! Oh My!" tutorial files
☆16Jan 25, 2020Updated 6 years ago
Mog9 / FlashAttention-CuPy
View on GitHub
Flash Attention from scratch, tiled CUDA forward kernel, online softmax with running max and correction factor, recomputation trick in ba…
☆18Mar 6, 2026Updated 4 months ago
davidrpugh / tensorflow-gpu-data-science-project
View on GitHub
Template repository for a Python 3-based (data) science project with GPU acceleration using the TensorFlow ecosystem.
☆12Nov 28, 2021Updated 4 years ago
UNITES-Lab / Occult
View on GitHub
[ICML‘25] Official code for paper "Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training an…
☆13Apr 17, 2025Updated last year
andreyklots / SuperQuantPackage
View on GitHub
Modeling and Analysis of Superconducting Quantum Circuits
☆13Feb 22, 2021Updated 5 years ago