sifakis / CS639S23_DemosLinks
Software artifacts and Demos for CS639 (Spring 2023) "Parallel and Throughput-Optimized Programming"
☆18Updated 2 years ago
Alternatives and similar repositories for CS639S23_Demos
Users that are interested in CS639S23_Demos are comparing it to the libraries listed below
Sorting:
- Competitive GPU kernel optimization platform.☆93Updated this week
- CUDA implementation of parallel Depth First Search (DFS) algorithm and it's comparison with a serial C++ DFS implementation.☆29Updated 7 years ago
- Introduction to CUDA programming and debugging☆15Updated 2 years ago
- Exercises from the Fall 2023 Algolab course at ETH Zürich☆21Updated 7 months ago
- An implementation of parallel exclusive scan in CUDA☆62Updated 7 years ago
- Loop Nest - Linear algebra compiler and code generator.☆22Updated 2 years ago
- A set of hands-on tutorials for CUDA programming☆230Updated last year
- High-Performance SGEMM on CUDA devices☆98Updated 6 months ago
- A Visual Studio Code extension for building and debugging CUDA applications.☆87Updated 2 weeks ago
- Code samples for the CUDA tutorial "CUDA and Applications to Task-based Programming"☆91Updated last year
- Sparsity support for PyTorch☆36Updated 4 months ago
- 11-785 Introduction to Deep Learning (IDeeL) website with logistics and select course materials☆53Updated this week
- By introducing a differentiable contact model, DiffCoSim extends the applicability of Lagrangian/Hamiltonian-inspired neural networks to …☆36Updated 2 years ago
- NVIDIA tools guide☆144Updated 7 months ago
- Neural network from scratch in CUDA/C++☆83Updated 6 months ago
- Direct solver for sparse SPD matrices for nonlinear optimization. Implements supernodal Cholesky decomposition algorithm, and supports GP…☆91Updated 2 months ago
- ☆47Updated 7 months ago
- CUDA Guide☆72Updated last year
- A curated list of awesome GPGPU (CUDA/OpenCL/Vulkan) resources☆99Updated 2 years ago
- Abstractions of memory, allocator, vector, tuple, shared_ptr, unique_ptr, bitset, variant and string working on both CPU and GPU☆30Updated 4 months ago
- A generic, composable multi-dimensional array library.☆12Updated 2 weeks ago
- ☆32Updated last year
- Learn OpenMP examples step by step☆95Updated 6 months ago
- Code and data for paper "(How) do Language Models Track State?"☆16Updated 4 months ago
- ☆66Updated last week
- General Matrix Multiplication using NVIDIA Tensor Cores☆18Updated 6 months ago
- CME 213 Spring 2021☆65Updated 4 years ago
- Udacity CS344 Introduction to Parallell Programming (https://classroom.udacity.com/courses/cs344), with assignments/materials updated to …☆46Updated 3 years ago
- ☆27Updated last year
- Personal solutions to the Triton Puzzles☆19Updated last year