TravisWThompson1 / Makefile_Example_CUDA_CPP_To_Executable
Example Makefile for CUDA and C++ source files in a standard project layout.
☆47Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for Makefile_Example_CUDA_CPP_To_Executable
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆187Updated this week
- Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)☆615Updated 3 months ago
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆558Updated 3 weeks ago
- ☆217Updated last week
- Instructions, Docker images, and examples for Nsight Compute and Nsight Systems☆128Updated 4 years ago
- Main Book repository for the Parallel and High Performance Computing book, Manning Publications☆176Updated 2 years ago
- Implementation and analysis of five different GPU based SPMV algorithms in CUDA☆35Updated 5 years ago
- Step-by-step optimization of CUDA SGEMM☆240Updated 2 years ago
- Matrix Multiply-Accumulate with CUDA and WMMA( Tensor Core)☆115Updated 4 years ago
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆100Updated last year
- collection of benchmarks to measure basic GPU capabilities☆265Updated 5 months ago
- CUDA Matrix Multiplication Optimization☆141Updated 4 months ago
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆196Updated 2 weeks ago
- Training material for Nsight developer tools☆129Updated 3 months ago
- CUDA Kernel Benchmarking Library☆519Updated this week
- PanguLU: A Scalable Regular Two-Dimensional Block-Cyclic Sparse Direct Solver on Distributed Heterogeneous Systems☆32Updated 2 months ago
- Kernel Tuner☆287Updated last week
- Fast CUDA matrix multiplication from scratch☆479Updated 10 months ago
- RAJA Performance Suite☆110Updated last week
- High-performance, GPU-aware communication library☆84Updated 3 weeks ago
- This is a set of simple programs that can be used to explore the features of a parallel platform.☆415Updated this week
- NVIDIA tools guide☆71Updated 3 months ago
- Efficient SpGEMM on GPU using CUDA and CSR☆50Updated last year
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆44Updated last month
- Examples from Programming in Parallel with CUDA☆108Updated last year
- ☆486Updated this week
- Advanced Profiling and Analytics for AMD Hardware☆135Updated this week
- Unified Collective Communication Library☆207Updated last week
- ☆393Updated 9 years ago
- Distributed View Extension for Kokkos☆43Updated 2 months ago