rapidsai / wholegraphLinks

WholeGraph - large scale Graph Neural Networks

☆104

Alternatives and similar repositories for wholegraph

Users that are interested in wholegraph are comparing it to the libraries listed below

Sorting:

amazon-science / FeatGraph
☆70Updated 4 years ago
hpcaitech / TensorNVMe
A Python library transfers PyTorch tensors between CPU and NVMe
☆116Updated 7 months ago
K-Wu / pytorch-direct_dgl
Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture (accepted by PVLDB)
☆44Updated 2 years ago
facebookresearch / dlrm_datasets
Set of datasets for the deep learning recommendation model (DLRM).
☆47Updated 2 years ago
quiver-team / torch-quiver
PyTorch Library for Low-Latency, High-Throughput Graph Learning on GPUs.
☆302Updated last year
hgyhungry / ge-spmm
☆109Updated 4 years ago
marius-team / marius
Large scale graph learning on a single machine.
☆162Updated 4 months ago
xiezhq-hermann / graphiler
Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into ef…
☆60Updated 2 years ago
intel / torch-ccl
oneCCL Bindings for Pytorch*
☆99Updated this week
MITIBMxGraph / SALIENT
The official SALIENT system described in the paper "Accelerating Training and Inference of Graph Neural Networks with Fast Sampling and P…
☆40Updated 2 years ago
awslabs / slapo
A schedule language for large model training
☆149Updated last year
intel / intel-extension-for-deepspeed
Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…
☆61Updated 2 weeks ago
xxcclong / GNN-Computing
Artifact for PPoPP20 "Understanding and Bridging the Gaps in Current GNN Performance Optimizations"
☆39Updated 3 years ago
NVIDIA-Merlin / HierarchicalKV
HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of…
☆156Updated 2 weeks ago
NVIDIA / compute-sanitizer-samples
Samples demonstrating how to use the Compute Sanitizer Tools and Public API
☆84Updated last year
facebookresearch / param
PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…
☆147Updated last week
alibaba / graphlearn-for-pytorch
A GPU-accelerated graph learning library for PyTorch, facilitating the scaling of GNN training and inference.
☆138Updated 3 months ago
google-research / sputnik
A library of GPU kernels for sparse matrix operations.
☆270Updated 4 years ago
uwsampl / SparseTIR
SparseTIR: Sparse Tensor Compiler for Deep Learning
☆138Updated 2 years ago
NVIDIA-Merlin / distributed-embeddings
distributed-embeddings is a library for building large embedding based models in Tensorflow 2.
☆44Updated last year
YukeWang96 / MGG_OSDI23
Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…
☆40Updated last year
NVIDIA / Fuser
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
☆343Updated this week
Azure / msccl
Microsoft Collective Communication Library
☆64Updated 7 months ago
microsoft / mscclpp
MSCCL++: A GPU-driven communication stack for scalable AI applications
☆385Updated this week
AlibabaResearch / flash-llm
Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity
☆216Updated last year
dgSPARSE / dgSPARSE-Lib
PyTorch-Based Fast and Efficient Processing for Various Machine Learning Applications with Diverse Sparsity
☆111Updated 2 weeks ago
microsoft / msccl-tools
Synthesizer for optimal collective communication algorithms
☆108Updated last year
awslabs / lorien
☆43Updated last year
microsoft / SuperScaler
An experimental parallel training platform
☆54Updated last year
octoml / octoml-profile
Home for OctoML PyTorch Profiler
☆113Updated 2 years ago