HyTruongSon / GraphFlowLinks
Deep Learning framework in C++/CUDA that supports symbolic/automatic differentiation, dynamic computation graphs, tensor/matrix operations accelerated by GPU and implementations of various state-of-the-art graph neural networks and other Machine Learning models including Covariant Compositional Networks For Learning Graphs [Risi et al]
☆54Updated 3 years ago
Alternatives and similar repositories for GraphFlow
Users that are interested in GraphFlow are comparing it to the libraries listed below
Sorting:
- Experiments with Message Passing GNNs in C++ and PyTorch.☆26Updated 11 months ago
- The Surprisingly ParalleL spArse Tensor Toolkit.☆71Updated 3 years ago
- Distributed NMF/NTF Library☆46Updated 7 months ago
- ☆70Updated 2 years ago
- Introduction to CUDA programming☆122Updated 8 years ago
- A fast implementation of spectral clustering on GPU-CPU Platform☆32Updated 7 years ago
- Implementing Google's DistBelief paper☆110Updated 2 years ago
- CUDA Matrix Factorization Library with Stochastic Gradient Descent (SGD)☆71Updated 7 years ago
- PyTorch-MPI-DDP-example☆17Updated 7 years ago
- Einsum optimization using opt_einsum and PyTorch FX graph rewriting☆21Updated 3 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆155Updated 2 years ago
- A Distributed Multi-GPU System for Fast Graph Processing☆65Updated 6 years ago
- Numerically Solving Parametric Families of High-Dimensional Kolmogorov Partial Differential Equations via Deep Learning (NeurIPS 2020)☆22Updated 2 years ago
- Structured matrices for compressing neural networks☆67Updated last year
- CUDA-accelerated minimum spanning tree algorithm -- data parallel Boruvka's algorithm☆19Updated 9 years ago
- The pMMF Multiresolution Matrix Factorization Library☆27Updated 7 years ago
- Computations involving Lie groups and harmonic analysis☆203Updated 5 months ago
- Notebooks for IPAM Tutorial, March 15 2019☆24Updated 6 years ago
- kmeans clustering with multi-GPU capabilities☆119Updated 2 years ago
- ☆23Updated 5 years ago
- matrix multiplication in CUDA☆123Updated last year
- GPU Accelerated Subsampled Newton Method for Convex Optimization☆8Updated 7 years ago
- HogWild++: A New Mechanism for Decentralized Asynchronous Stochastic Gradient Descent☆33Updated 8 years ago
- PyTorch Extension Library of Optimized Unique Operation☆37Updated 6 years ago
- Test winograd convolution written in TVM for CUDA and AMDGPU☆41Updated 6 years ago
- Efficient LDA solution on GPUs.☆24Updated 6 years ago
- Code for Solving Black-Box Optimization Challenge via Learning Search Space Partition for Local Bayesian Optimization.☆21Updated 3 years ago
- CUDA kernels for generalized matrix-multiplication in PyTorch☆85Updated 3 years ago
- ☆51Updated 11 months ago
- IPC: A Graph Data Set Compiled from International Planning Competitions☆46Updated 5 years ago