nvidia-china-sae / WholeGraph
☆11Updated 4 years ago
Alternatives and similar repositories for WholeGraph:
Users that are interested in WholeGraph are comparing it to the libraries listed below
- ☆71Updated 3 years ago
- PyTorch-Direct code on top of PyTorch-1.8.0nightly (e152ca5) for Large Graph Convolutional Network Training with GPU-Oriented Data Commun…☆44Updated last year
- Light-weight GPU kernel interface for graph operations☆15Updated 4 years ago
- Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into ef…☆61Updated 2 years ago
- An Attention Superoptimizer☆21Updated 3 months ago
- ☆38Updated last year
- Set of datasets for the deep learning recommendation model (DLRM).☆45Updated 2 years ago
- Implementation of FusedMM method for IPDPS 2021 paper titled "FusedMM: A Unified SDDMM-SpMM Kernel for Graph Embedding and Graph Neural N…☆30Updated 2 years ago
- Memory Optimizations for Deep Learning (ICML 2023)☆64Updated last year
- PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models. ICML 2021☆56Updated 3 years ago
- WholeGraph - large scale Graph Neural Networks☆105Updated 5 months ago
- Artifact for PPoPP20 "Understanding and Bridging the Gaps in Current GNN Performance Optimizations"☆39Updated 3 years ago
- LazyGCN☆9Updated 4 years ago
- ICLR 2021☆48Updated 4 years ago
- ☆45Updated 2 weeks ago
- [ICLR 2025] TidalDecode: A Fast and Accurate LLM Decoding with Position Persistent Sparse Attention☆35Updated last week
- ☆106Updated 3 years ago
- ☆22Updated last year
- Accelerating Recommender model training by leveraging popular choices -- VLDB 2022☆30Updated 7 months ago
- This repository contains the results and code for the MLPerf™ Training v1.0 benchmark.☆38Updated last year
- Inference framework for MoE layers based on TensorRT with Python binding☆41Updated 3 years ago
- Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…☆40Updated last year
- ☆24Updated 2 years ago
- Distributed Deep Graph Learning Framework for Dynamic Graphs☆12Updated last year
- ☆16Updated 2 years ago
- [MLSys 2022] "BNS-GCN: Efficient Full-Graph Training of Graph Convolutional Networks with Partition-Parallelism and Random Boundary Node …☆56Updated last year
- Largest realworld open-source graph dataset - Worked done under IBM-Illinois Discovery Accelerator Institute and Amazon Research Awards a…☆82Updated 4 months ago
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆135Updated 2 years ago
- GraphZoom: A Multi-level Spectral Approach for Accurate and Scalable Graph Embedding☆114Updated 2 years ago
- A high-performance distributed deep learning system targeting large-scale and automated distributed training. If you have any interests, …☆110Updated last year