HyTruongSon / GraphFlowLinks
Deep Learning framework in C++/CUDA that supports symbolic/automatic differentiation, dynamic computation graphs, tensor/matrix operations accelerated by GPU and implementations of various state-of-the-art graph neural networks and other Machine Learning models including Covariant Compositional Networks For Learning Graphs [Risi et al]
☆53Updated 4 years ago
Alternatives and similar repositories for GraphFlow
Users that are interested in GraphFlow are comparing it to the libraries listed below
Sorting:
- ☆23Updated 6 years ago
- The Surprisingly ParalleL spArse Tensor Toolkit.☆73Updated 3 years ago
- Introduction to CUDA programming☆129Updated 8 years ago
- Experiments with Message Passing GNNs in C++ and PyTorch.☆26Updated last year
- A fast implementation of spectral clustering on GPU-CPU Platform☆32Updated 7 years ago
- Light-weight GPU kernel interface for graph operations☆15Updated 5 years ago
- CUDA templates for tile-sparse matrix multiplication based on CUTLASS.☆50Updated 7 years ago
- matrix multiplication in CUDA☆123Updated 2 years ago
- Einsum optimization using opt_einsum and PyTorch FX graph rewriting☆22Updated 3 years ago
- ☆70Updated 2 years ago
- Computations involving Lie groups and harmonic analysis☆205Updated 9 months ago
- Sparse matrix computation library for GPU☆59Updated 5 years ago
- Cyclops Tensor Framework: parallel arithmetic on multidimensional arrays☆209Updated 5 months ago
- ☆94Updated 8 years ago
- CUDA Matrix Factorization Library with Stochastic Gradient Descent (SGD)☆71Updated 7 years ago
- HogWild++: A New Mechanism for Decentralized Asynchronous Stochastic Gradient Descent☆33Updated 9 years ago
- Sparse Matrix-Matrix Multiplication Benchmark on Intel Xeon and Xeon Phi (KNC, KNL) from blog post:☆12Updated 9 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆156Updated 2 years ago
- High-Performance Tensor Transpose library☆204Updated 2 years ago
- GPU implementation of classical molecular dynamics proxy application.☆31Updated 8 years ago
- 📝 "End-to-end Deep Learning of Optimization Heuristics" (🥇 PACT'17 Best Paper)☆72Updated 2 years ago
- Tensor Train decomposition on TensorFlow☆227Updated 4 years ago
- implement distributed machine learning with Pytorch + OpenMPI☆52Updated 6 years ago
- Distributed NMF/NTF Library☆51Updated 11 months ago
- Structured matrices for compressing neural networks☆67Updated 2 years ago
- Deriving Neural Architectures from Sequence and Graph Kernels☆59Updated 8 years ago
- Kernel Fusion and Runtime Compilation Based on NNVM☆72Updated 9 years ago
- kmeans clustering with multi-GPU capabilities☆119Updated 2 years ago
- Test winograd convolution written in TVM for CUDA and AMDGPU☆41Updated 7 years ago
- BayesGrad: Explaining Predictions of Graph Convolutional Networks☆63Updated 3 years ago