dglai / FeatGraph
Sparse kernels for GNNs based on TVM
☆16Updated 4 years ago
Alternatives and similar repositories for FeatGraph:
Users that are interested in FeatGraph are comparing it to the libraries listed below
- ☆13Updated 3 years ago
- ☆18Updated this week
- Benchmark for matrix multiplications between dense and block sparse (BSR) matrix in TVM, blocksparse (Gray et al.) and cuSparse.☆24Updated 4 years ago
- Code base for OOPSLA'24 paper: UniSparse: An Intermediate Language for General Sparse Format Customization☆30Updated 5 months ago
- ☆22Updated 2 years ago
- ☆14Updated 2 years ago
- ☆25Updated 3 years ago
- ☆14Updated 3 years ago
- Singular Binarized Neural Network based on GPU Bit Operations (see our SC-19 paper)☆15Updated 4 years ago
- A unified programming framework for high and portable performance across FPGAs and GPUs☆11Updated last month
- This repo is to collect the state-of-the-art GNN hardware acceleration paper☆54Updated 3 years ago
- A novel spatial accelerator for horizontal diffusion weather stencil computation, as described in ICS 2023 paper by Singh et al. (https:/…☆19Updated last year
- ☆14Updated 2 years ago
- Repo for the IISWC 2018 submission☆9Updated 3 years ago
- A PIM instrumentation, compilation, execution, simulation, and evaluation repository for BLIMP-style architectures.☆18Updated 2 years ago
- Artifact for PPoPP20 "Understanding and Bridging the Gaps in Current GNN Performance Optimizations"☆39Updated 3 years ago
- agile hardware-software co-design☆46Updated 3 years ago
- Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into ef…☆61Updated 2 years ago
- MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)☆50Updated 11 months ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Updated 5 years ago
- Repository for artifact evaluation of ASPLOS 2023 paper "SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning"☆25Updated 2 years ago
- EQueue Dialect☆40Updated 3 years ago
- Artifact repository for paper Automatic Generation of High-Performance Quantized Machine Learning Kernels☆17Updated 4 years ago
- Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018☆71Updated 4 years ago
- Mille Crepe Bench: layer-wise performance analysis for deep learning frameworks.☆17Updated 5 years ago
- ☆33Updated 3 years ago
- ☆71Updated 3 years ago
- Heron: Automatically Constrained High-Performance Library Generation for Deep Learning Accelerators☆19Updated last year
- A reference implementation of the Mind Mappings Framework.☆29Updated 3 years ago
- Implementation of FusedMM method for IPDPS 2021 paper titled "FusedMM: A Unified SDDMM-SpMM Kernel for Graph Embedding and Graph Neural N…☆30Updated 2 years ago