suco-gt / HPC-Internships
Supercomputing @ GT has compiled a list of organizations that offer internships and experiences in HPC and applications of HPC.
☆56Updated last year
Alternatives and similar repositories for HPC-Internships:
Users that are interested in HPC-Internships are comparing it to the libraries listed below
- A curated list of awesome high performance computing resources☆745Updated 2 weeks ago
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆206Updated last month
- NVIDIA tools guide☆96Updated 2 weeks ago
- collection of benchmarks to measure basic GPU capabilities☆282Updated 2 weeks ago
- Fast CUDA matrix multiplication from scratch☆580Updated last year
- Example Makefile for CUDA and C++ source files in a standard project layout.☆48Updated 7 years ago
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆45Updated 3 months ago
- Instructions, Docker images, and examples for Nsight Compute and Nsight Systems☆130Updated 4 years ago
- Rodinia benchmark☆15Updated 6 months ago
- Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite☆63Updated 6 years ago
- Stepwise optimizations of DGEMM on CPU, reaching performance faster than Intel MKL eventually, even under multithreading.☆121Updated 2 years ago
- Step-by-step optimization of CUDA SGEMM☆272Updated 2 years ago
- Source code of the SC '23 paper: "DASP: Specific Dense Matrix Multiply-Accumulate Units Accelerated General Sparse Matrix-Vector Multipli…☆24Updated 7 months ago
- Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)☆659Updated 5 months ago
- ☆109Updated 3 months ago
- PanguLU: A Scalable Regular Two-Dimensional Block-Cyclic Sparse Direct Solver on Distributed Heterogeneous Systems☆33Updated last month
- Rodinia benchmark☆169Updated last year
- Implementation and analysis of five different GPU based SPMV algorithms in CUDA☆37Updated 5 years ago
- ☆114Updated 5 months ago
- IMPACT GPU Algorithms Teaching Labs☆56Updated last year
- Advanced Matrix Extensions (AMX) Guide☆78Updated 3 years ago
- CUDA Matrix Multiplication Optimization☆153Updated 6 months ago
- Main Book repository for the Parallel and High Performance Computing book, Manning Publications☆188Updated 2 years ago
- 📚 A curated list of awesome matrix-matrix multiplication (A * B = C) frameworks, libraries and software☆13Updated last month
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆197Updated last month
- 🎃 GPU load-balancing library for regular and irregular computations.☆58Updated 7 months ago
- N-Ways to Multi-GPU Programming☆15Updated last year
- XSBench: The Monte Carlo Macroscopic Cross Section Lookup Benchmark☆75Updated 10 months ago
- Source code of the PPoPP '22 paper: "TileSpGEMM: A Tiled Algorithm for Parallel Sparse General Matrix-Matrix Multiplication on GPUs" by Y…☆38Updated 7 months ago
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆48Updated this week