nvidia-china-sae / WholeGraph
☆11Updated 3 years ago
Alternatives and similar repositories for WholeGraph:
Users that are interested in WholeGraph are comparing it to the libraries listed below
- ☆73Updated 3 years ago
- Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into ef…☆61Updated 2 years ago
- WholeGraph - large scale Graph Neural Networks☆101Updated 2 months ago
- PyTorch-Direct code on top of PyTorch-1.8.0nightly (e152ca5) for Large Graph Convolutional Network Training with GPU-Oriented Data Commun…☆45Updated last year
- G3: A Programmable GNN Training System on GPU☆43Updated 4 years ago
- Light-weight GPU kernel interface for graph operations☆15Updated 4 years ago
- The official SALIENT system described in the paper "Accelerating Training and Inference of Graph Neural Networks with Fast Sampling and P…☆38Updated last year
- An Attention Superoptimizer☆21Updated 3 weeks ago
- Artifact for PPoPP20 "Understanding and Bridging the Gaps in Current GNN Performance Optimizations"☆39Updated 3 years ago
- Inference framework for MoE layers based on TensorRT with Python binding☆41Updated 3 years ago
- ☆38Updated last year
- A Python library transfers PyTorch tensors between CPU and NVMe☆103Updated 2 months ago
- ☆104Updated 3 years ago
- ☆46Updated 2 years ago
- Largest realworld open-source graph dataset - Worked done under IBM-Illinois Discovery Accelerator Institute and Amazon Research Awards a…☆79Updated 2 months ago
- ☆36Updated 8 months ago
- A high-performance distributed deep learning system targeting large-scale and automated distributed training. If you have any interests, …☆107Updated last year
- [MLSys 2022] "BNS-GCN: Efficient Full-Graph Training of Graph Convolutional Networks with Partition-Parallelism and Random Boundary Node …☆53Updated last year
- ☆44Updated last year
- LazyGCN☆9Updated 4 years ago
- High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph☆50Updated 2 years ago
- ☆23Updated 2 months ago
- GPTQ inference TVM kernel☆38Updated 9 months ago
- Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…☆38Updated 11 months ago
- ICLR 2021☆46Updated 3 years ago
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆134Updated last year
- PyTorch Library for Low-Latency, High-Throughput Graph Learning on GPUs.☆296Updated last year
- This repository contains the results and code for the MLPerf™ Training v1.0 benchmark.☆38Updated 11 months ago
- Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …☆64Updated 2 years ago
- My paper/code reading notes in Chinese☆46Updated 8 months ago