dian-lun-lin / SNIGLinks
SNIG: Accelerated Large Sparse Neural Network Inference using Task Graph Parallelism
ā34Updated 4 years ago
Alternatives and similar repositories for SNIG
Users that are interested in SNIG are comparing it to the libraries listed below
Sorting:
- Artifact of ASPLOS'23 paper entitled: GRACE: A Scalable Graph-Based Approach to Accelerating Recommendation Model Inferenceā19Updated 2 years ago
- š GPU load-balancing library for regular and irregular computations.ā66Updated 4 months ago
- Multi-GPU dynamic scheduler using PGAS style cross-GPU communicationā29Updated 2 years ago
- Heterogeneous Programmingā17Updated 2 years ago
- ā40Updated 3 years ago
- Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018ā73Updated 5 years ago
- ā40Updated 5 years ago
- ā50Updated 6 years ago
- development repository for the open earth compilerā82Updated 4 years ago
- PTX-EMU is a simple emulator for CUDA program.ā38Updated 9 months ago
- PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Correctionsā124Updated 3 years ago
- MLIR Sample dialectā137Updated last month
- An extension library of WMMA API (Tensor Core API)ā109Updated last year
- Dissecting NVIDIA GPU Architectureā117Updated 3 years ago
- ā11Updated 2 years ago
- Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.ā91Updated 3 years ago
- A lightweight memory allocator for hardware-accelerated machine learningā180Updated 4 months ago
- Multilevel Directed Acyclic Graph Partitionerā35Updated 3 years ago
- ā31Updated 3 years ago
- ā68Updated 6 years ago
- Conversions to MLIR EmitCā134Updated last year
- TLB Benchmarksā35Updated 8 years ago
- Evaluating different memory managers for dynamic GPU memoryā26Updated 5 years ago
- Code for paper "Engineering a High-Performance GPU B-Tree" accepted to PPoPP 2019ā58Updated 3 years ago
- CSR-based SpGEMM on nVidia and AMD GPUsā46Updated 9 years ago
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Acceleratorsā121Updated 3 years ago
- ngAP's artifact for ASPLOS'24ā25Updated 6 months ago
- Concurrent CPU-GPU Programming using Task Modelsā106Updated 6 years ago
- We invite you to visit and follow our new repository at https://github.com/microsoft/TileFusion. TiledCUDA is a highly efficient kernel ā¦ā192Updated last year
- A home for the final text of all TVM RFCs.ā109Updated last year