ntuhpc / training-ay1819Links
sample code/text used in NTU HPC Internal Training during AY2018-2019
☆24Updated 6 years ago
Alternatives and similar repositories for training-ay1819
Users that are interested in training-ay1819 are comparing it to the libraries listed below
Sorting:
- Seminar on selected tools in Computer Science☆25Updated 4 years ago
- ISC Student Cluster Competition 2019 -- AI challenge☆10Updated 6 years ago
- An efficient concurrent graph processing system☆46Updated 4 years ago
- IMPACT GPU Algorithms Teaching Labs☆58Updated 2 years ago
- Learn OpenMP examples step by step☆97Updated 9 months ago
- ☆31Updated 3 years ago
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆112Updated 2 years ago
- CUDA C++ syntax support & snippets for VSCode☆20Updated 4 years ago
- cuASR: CUDA Algebra for Semirings☆39Updated 3 years ago
- Online CUDA Occupancy Calculator☆80Updated 4 years ago
- TLB Benchmarks☆34Updated 8 years ago
- Adaptive Message Quantization and Parallelization for Distributed Full-graph GNN Training☆24Updated last year
- A hybrid partitioner based quantum circuit simulation system on GPU☆48Updated 3 years ago
- matrix multiplication in CUDA☆123Updated 2 years ago
- My paper/code reading notes in Chinese☆46Updated 4 months ago
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆210Updated this week
- Implementation of FusedMM method for IPDPS 2021 paper titled "FusedMM: A Unified SDDMM-SpMM Kernel for Graph Embedding and Graph Neural N…☆31Updated 3 years ago
- Rodinia benchmark☆189Updated 2 years ago
- Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into ef…☆59Updated 3 years ago
- NeuroVectorizer is a framework that uses deep reinforcement learning (RL) to predict optimal vectorization compiler pragmas for for loops…☆96Updated 2 years ago
- ☆29Updated 5 years ago
- Chai☆45Updated last year
- Galois: C++ library for multi-core and multi-node parallelization☆342Updated last year
- SparseP is the first open-source Sparse Matrix Vector Multiplication (SpMV) software package for real-world Processing-In-Memory (PIM) ar…☆77Updated 3 years ago
- THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.☆84Updated last year
- Introduction to CUDA programming☆128Updated 8 years ago
- PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections☆122Updated 3 years ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Updated 7 months ago
- CAKE Library for constant-bandwidth matrix multiplication on CPUs☆15Updated last year
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆132Updated this week