HyTruongSon / GraphFlowLinks
Deep Learning framework in C++/CUDA that supports symbolic/automatic differentiation, dynamic computation graphs, tensor/matrix operations accelerated by GPU and implementations of various state-of-the-art graph neural networks and other Machine Learning models including Covariant Compositional Networks For Learning Graphs [Risi et al]
☆53Updated 4 years ago
Alternatives and similar repositories for GraphFlow
Users that are interested in GraphFlow are comparing it to the libraries listed below
Sorting:
- Experiments with Message Passing GNNs in C++ and PyTorch.☆26Updated last year
- Light-weight GPU kernel interface for graph operations☆15Updated 5 years ago
- HogWild++: A New Mechanism for Decentralized Asynchronous Stochastic Gradient Descent☆33Updated 9 years ago
- ☆70Updated 2 years ago
- Distributed NMF/NTF Library☆51Updated last year
- Computations involving Lie groups and harmonic analysis☆206Updated 11 months ago
- The Surprisingly ParalleL spArse Tensor Toolkit.☆73Updated 3 years ago
- ☆23Updated 6 years ago
- A fast implementation of spectral clustering on GPU-CPU Platform☆32Updated 7 years ago
- Introduction to CUDA programming☆129Updated 8 years ago
- Graph Convolutional Networks in JAX☆33Updated 5 years ago
- Einsum optimization using opt_einsum and PyTorch FX graph rewriting☆22Updated 3 years ago
- A Generic Tensor-Network library that is designed for quantum simulation, base on the pytorch☆59Updated 6 years ago
- BayesGrad: Explaining Predictions of Graph Convolutional Networks☆63Updated 4 years ago
- IPC: A Graph Data Set Compiled from International Planning Competitions☆47Updated 6 years ago
- Parallel Tensor Infrastructure (ParTI!)☆33Updated 5 years ago
- High-Performance Tensor Transpose library☆205Updated 2 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆158Updated 2 years ago
- Cyclops Tensor Framework: parallel arithmetic on multidimensional arrays☆213Updated 7 months ago
- This repository is the summary of all of our works for the XLA.☆11Updated 8 years ago
- CUDA Matrix Factorization Library with Stochastic Gradient Descent (SGD)☆71Updated 8 years ago
- Code for Solving Black-Box Optimization Challenge via Learning Search Space Partition for Local Bayesian Optimization.☆21Updated 4 years ago
- CUDA kernels for generalized matrix-multiplication in PyTorch☆85Updated 4 years ago
- [ICLR 2019] Learning Representations of Sets through Optimized Permutations☆36Updated 6 years ago
- CUDA implementation of the Blocked Floyd Warshall All pairs shortest path graph algorithm☆42Updated 7 years ago
- Fast Fast Hadamard Transform☆89Updated 4 years ago
- Implements the frequent directions algorithm for approximating matrices in streams☆33Updated 8 years ago
- CUDA templates for tile-sparse matrix multiplication based on CUTLASS.☆50Updated 7 years ago
- matrix multiplication in CUDA☆125Updated 2 years ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Updated 8 years ago