nvidia-china-sae / WholeGraph
☆11Updated 4 years ago
Alternatives and similar repositories for WholeGraph:
Users that are interested in WholeGraph are comparing it to the libraries listed below
- ☆71Updated 3 years ago
- Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into ef…☆61Updated 2 years ago
- PyTorch-Direct code on top of PyTorch-1.8.0nightly (e152ca5) for Large Graph Convolutional Network Training with GPU-Oriented Data Commun…☆44Updated last year
- WholeGraph - large scale Graph Neural Networks☆102Updated 4 months ago
- An Attention Superoptimizer☆21Updated 2 months ago
- Light-weight GPU kernel interface for graph operations☆15Updated 4 years ago
- [MLSys 2022] "BNS-GCN: Efficient Full-Graph Training of Graph Convolutional Networks with Partition-Parallelism and Random Boundary Node …☆56Updated last year
- Set of datasets for the deep learning recommendation model (DLRM).☆43Updated 2 years ago
- Artifact for PPoPP20 "Understanding and Bridging the Gaps in Current GNN Performance Optimizations"☆39Updated 3 years ago
- ☆46Updated 2 years ago
- High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph☆51Updated 2 years ago
- ☆16Updated 2 years ago
- Inference framework for MoE layers based on TensorRT with Python binding☆41Updated 3 years ago
- ☆26Updated 3 years ago
- A high-performance distributed deep learning system targeting large-scale and automated distributed training. If you have any interests, …☆109Updated last year
- ☆45Updated 2 weeks ago
- LazyGCN☆9Updated 4 years ago
- ☆106Updated 3 years ago
- ☆38Updated last year
- Largest realworld open-source graph dataset - Worked done under IBM-Illinois Discovery Accelerator Institute and Amazon Research Awards a…☆79Updated 3 months ago
- [ICDCS 2023] DeAR: Accelerating Distributed Deep Learning with Fine-Grained All-Reduce Pipelining☆12Updated last year
- Distributed DataLoader For Pytorch Based On Ray☆24Updated 3 years ago
- Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…☆40Updated last year
- Memory footprint reduction for transformer models☆11Updated 2 years ago
- PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models. ICML 2021☆56Updated 3 years ago
- A Python library transfers PyTorch tensors between CPU and NVMe☆111Updated 4 months ago
- Memory Optimizations for Deep Learning (ICML 2023)☆62Updated last year
- ☆22Updated last year
- The official SALIENT system described in the paper "Accelerating Training and Inference of Graph Neural Networks with Fast Sampling and P…☆39Updated last year
- ☆43Updated last year