CMUAbstract / Graph-Reordering-IISWC18
Repo for the IISWC 2018 submission
☆9Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Graph-Reordering-IISWC18
- Artifact for OSDI'21 GNNAdvisor: An Adaptive and Efficient Runtime System for GNN Acceleration on GPUs.☆63Updated last year
- Artifact for PPoPP20 "Understanding and Bridging the Gaps in Current GNN Performance Optimizations"☆39Updated 3 years ago
- Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into ef…☆60Updated 2 years ago
- Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…☆37Updated 8 months ago
- Distributed Multi-GPU GNN Framework☆36Updated 4 years ago
- [HPCA 2022] GCoD: Graph Convolutional Network Acceleration via Dedicated Algorithm and Accelerator Co-Design☆32Updated 2 years ago
- ☆10Updated 3 years ago
- A Factored System for Sample-based GNN Training over GPUs☆42Updated last year
- This repo is to collect the state-of-the-art GNN hardware acceleration paper☆54Updated 3 years ago
- ☆26Updated 5 months ago
- ☆44Updated 2 years ago
- Source code for the graph reordering technique, DBG, published in [Faldu et al., IISWC'19].☆7Updated 5 years ago
- SoCC'20 and TPDS'21: Scaling GNN Training on Large Graphs via Computation-aware Caching and Partitioning.☆48Updated last year
- Ginex: SSD-enabled Billion-scale Graph Neural Network Training on a Single Machine via Provably Optimal In-memory Caching☆35Updated 4 months ago
- Artifact for PPoPP22 QGTC: Accelerating Quantized GNN via GPU Tensor Core.☆27Updated 2 years ago
- FGNN's artifact evaluation (EuroSys 2022)☆17Updated 2 years ago
- A Framework for Graph Sampling and Random Walk on GPUs.☆38Updated 2 years ago
- Artifact for USENIX ATC'23: TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs.☆45Updated last year
- Graph Sampling using GPU☆51Updated 2 years ago
- ☆9Updated 2 years ago
- ☆18Updated 4 years ago
- PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization☆25Updated 9 months ago
- ☆101Updated 3 years ago
- FlashMob is a shared-memory random walk system.☆31Updated last year
- ☆42Updated last week
- A pattern-based algorithmic autotuner for graph processing on GPUs.☆30Updated last year
- An efficient storage system for concurrent graph processing☆10Updated 3 years ago
- Implementation of FusedMM method for IPDPS 2021 paper titled "FusedMM: A Unified SDDMM-SpMM Kernel for Graph Embedding and Graph Neural N…☆28Updated 2 years ago
- Transforming Graphs for Efficient Irregular Graph Processing on GPUs☆47Updated 2 years ago
- ☆10Updated last year