HyTruongSon / GraphFlowLinks
Deep Learning framework in C++/CUDA that supports symbolic/automatic differentiation, dynamic computation graphs, tensor/matrix operations accelerated by GPU and implementations of various state-of-the-art graph neural networks and other Machine Learning models including Covariant Compositional Networks For Learning Graphs [Risi et al]
☆53Updated 4 years ago
Alternatives and similar repositories for GraphFlow
Users that are interested in GraphFlow are comparing it to the libraries listed below
Sorting:
- ☆23Updated 5 years ago
- Einsum optimization using opt_einsum and PyTorch FX graph rewriting☆22Updated 3 years ago
- Light-weight GPU kernel interface for graph operations☆15Updated 5 years ago
- CUDA templates for tile-sparse matrix multiplication based on CUTLASS.☆50Updated 7 years ago
- Experiments with Message Passing GNNs in C++ and PyTorch.☆26Updated last year
- Introduction to CUDA programming☆126Updated 8 years ago
- Computations involving Lie groups and harmonic analysis☆205Updated 8 months ago
- Deriving Neural Architectures from Sequence and Graph Kernels☆59Updated 7 years ago
- Distributed NMF/NTF Library☆48Updated 10 months ago
- The Surprisingly ParalleL spArse Tensor Toolkit.☆72Updated 3 years ago
- HogWild++: A New Mechanism for Decentralized Asynchronous Stochastic Gradient Descent☆33Updated 9 years ago
- Structured matrices for compressing neural networks☆67Updated 2 years ago
- Parallel Tensor Infrastructure (ParTI!)☆30Updated 5 years ago
- Slides and notebooks for the IfI Summer School 2018 on Machine Learning☆36Updated 7 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆156Updated 2 years ago
- ☆70Updated 2 years ago
- Automatically insert nvtx ranges to PyTorch models☆19Updated 4 years ago
- Efficient Unitary Neural Network(EUNN) implementation in Tensorflow☆73Updated 6 years ago
- ☆93Updated 8 years ago
- implement distributed machine learning with Pytorch + OpenMPI☆52Updated 6 years ago
- A general purpose library for numerical calculations with higher order tensors, Tensor-Train Decompositions / Matrix Product States and o…☆20Updated 3 years ago
- CUDA Matrix Factorization Library with Stochastic Gradient Descent (SGD)☆71Updated 7 years ago
- Sparse Matrix-Matrix Multiplication Benchmark on Intel Xeon and Xeon Phi (KNC, KNL) from blog post:☆12Updated 9 years ago
- Tensor Train decomposition on TensorFlow☆226Updated 4 years ago
- The pMMF Multiresolution Matrix Factorization Library☆27Updated 7 years ago
- matrix multiplication in CUDA☆123Updated 2 years ago
- A fast implementation of spectral clustering on GPU-CPU Platform☆32Updated 7 years ago
- ☆25Updated 7 years ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Updated 8 years ago
- IPC: A Graph Data Set Compiled from International Planning Competitions☆46Updated 5 years ago