feifeibear / SWCaffe
A Deep Learning Framework customized for Sunway TaihuLight
☆39Updated 5 years ago
Related projects: ⓘ
- A highly efficient library for GEMM operations on Sunway TaihuLight☆14Updated 4 years ago
- gossip: Efficient Communication Primitives for Multi-GPU Systems☆58Updated 2 years ago
- CUDA Tensor Transpose (cuTT) library☆49Updated 7 years ago
- Kernel Fusion and Runtime Compilation Based on NNVM☆69Updated 7 years ago
- this is the release repository of superneurons☆52Updated 3 years ago
- flexible-gemm conv of deepcore☆17Updated 4 years ago
- Automated machine learning as an AI-HPC benchmark☆63Updated 2 years ago
- Subpart source code of of deepcore v0.7☆27Updated 4 years ago
- High-performance, GPU-aware communication library☆85Updated last month
- Third party assembler and GEMM library for NVIDIA Kepler GPU☆76Updated 4 years ago
- HCC Sample Applications☆13Updated 7 years ago
- ☆21Updated this week
- ☆20Updated 2 years ago
- Implementation of TSM2L and TSM2R -- High-Performance Tall-and-Skinny Matrix-Matrix Multiplication Algorithms for CUDA☆31Updated 4 years ago
- A simple tool to profile performance of multiple combinations of GEMM of cuBLAS☆24Updated 3 years ago
- Repository for SysML19 Artifacts Evaluation☆53Updated 5 years ago
- High performance NCCL plugin for Bagua.☆15Updated 3 years ago
- ☆39Updated 3 years ago
- Symbolic Expression and Statement Module for new DSLs☆205Updated 3 years ago
- Dissecting NVIDIA GPU Architecture☆78Updated 2 years ago
- code for benchmarking GPU performance based on cublasSgemm and cublasHgemm☆28Updated 2 years ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆96Updated 7 years ago
- A quick way to benchmark your CUDA compiler on a Linux environment☆24Updated 13 years ago
- CUPTI GPU Profiler☆36Updated 5 years ago
- Optimized half precision gemm assembly kernels (deprecated due to ROCm)☆47Updated 7 years ago
- CSR5-based SpMV on CPUs, GPUs and Xeon Phi☆93Updated 3 months ago
- An experimental ahead of time compiler for Relay.☆51Updated 4 years ago
- GPU Performance Advisor☆58Updated 2 years ago
- ☆127Updated 6 years ago