IvanVigor / Balanced_Graph_Partitioning
Implementation of Balanced Graph Partitioning Konstantin" - Andreev and Harald Racke (Authors of the paper) by Ivan Vigorito and Lorenzo Frigerio
☆14Updated 2 years ago
Alternatives and similar repositories for Balanced_Graph_Partitioning
Users that are interested in Balanced_Graph_Partitioning are comparing it to the libraries listed below
Sorting:
- This repo is to collect the state-of-the-art GNN hardware acceleration paper☆54Updated 3 years ago
- Artifact for PPoPP22 QGTC: Accelerating Quantized GNN via GPU Tensor Core.☆28Updated 3 years ago
- Artifact for PPoPP20 "Understanding and Bridging the Gaps in Current GNN Performance Optimizations"☆39Updated 3 years ago
- Artifact for OSDI'21 GNNAdvisor: An Adaptive and Efficient Runtime System for GNN Acceleration on GPUs.☆66Updated 2 years ago
- Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…☆40Updated last year
- [HPCA 2022] GCoD: Graph Convolutional Network Acceleration via Dedicated Algorithm and Accelerator Co-Design☆36Updated 3 years ago
- ☆71Updated 3 years ago
- A Dataflow library for graph analytics acceleration☆14Updated 9 years ago
- ☆46Updated 2 years ago
- G3: A Programmable GNN Training System on GPU☆43Updated 4 years ago
- Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into ef…☆61Updated 2 years ago
- ☆106Updated 3 years ago
- ☆45Updated last month
- A Framework for Graph Sampling and Random Walk on GPUs.☆39Updated 3 months ago
- Repo for the IISWC 2018 submission☆9Updated 3 years ago
- Distributed Multi-GPU GNN Framework☆37Updated 4 years ago
- Benchmark for matrix multiplications between dense and block sparse (BSR) matrix in TVM, blocksparse (Gray et al.) and cuSparse.☆24Updated 4 years ago
- [FPGA 2020] Open sourced implementation for the ACM/SIGDA FPGA '20 paper titled "GraphACT: Accelerating GCN Training on CPU-FPGA Heteroge…☆18Updated 4 years ago
- The official code for DATE'23 paper <CLAP: Locality Aware and Parallel Triangle Counting with Content Addressable Memory>☆20Updated last month
- PyTorch-Direct code on top of PyTorch-1.8.0nightly (e152ca5) for Large Graph Convolutional Network Training with GPU-Oriented Data Commun…☆44Updated last year
- Transforming Graphs for Efficient Irregular Graph Processing on GPUs☆47Updated 2 years ago
- Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018☆71Updated 4 years ago
- A pattern-based algorithmic autotuner for graph processing on GPUs.☆30Updated 5 months ago
- ☆12Updated 2 years ago
- Sparse kernels for GNNs based on TVM☆16Updated 4 years ago
- agile hardware-software co-design☆46Updated 3 years ago
- An efficient storage system for concurrent graph processing☆10Updated 4 years ago
- PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization☆30Updated last year
- ☆25Updated 5 years ago
- Workload-Aware Co-Optimization☆8Updated 2 years ago