MartinThoma / matrix-multiplicationLinks
Some scripts in Python, Java and C++ for matrix multiplication.
☆93Updated 4 years ago
Alternatives and similar repositories for matrix-multiplication
Users that are interested in matrix-multiplication are comparing it to the libraries listed below
Sorting:
- Fork of magma to include more BLAS☆28Updated 8 years ago
- Source code from NVIDIA CUDACasts☆49Updated 11 years ago
- A CUDA implementation of the PageRank Pipeline Benchmark☆32Updated 8 years ago
- Resources to work offline on the assignments of Heterogenous Parallel Programming course from Coursera.☆72Updated 5 years ago
- different types of tutorials, such as machine learning, image processing and etc.☆102Updated 9 years ago
- Quasi Random Number Generator☆49Updated 4 years ago
- ArrayFire's Machine Learning Library.☆105Updated 6 years ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 5 years ago
- Graph-based learning in Python☆17Updated 7 years ago
- Symbolic differentiation engine for optimization-based machine learning models.☆43Updated 7 years ago
- pydeeplearn is a simple deep learning library written from scratch entirely in Python (CNN / RNN)☆15Updated 10 years ago
- ☆30Updated 7 years ago
- LASSO is a parallel regression model learning system☆69Updated 11 years ago
- Introduction to Parallel Programming class code☆30Updated 10 years ago
- Python wrappers for the NVIDIA cuDNN libraries☆140Updated 8 years ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Updated 7 years ago
- This is the code for "An introduction to GPU Programming with CUDA" by Siraj Raval on Youtube☆61Updated 7 years ago
- A simple template for TensorFlow's highly efficient CudnnLSTM module☆11Updated 7 years ago
- Benchmarking matrix multiplication implementations☆100Updated 8 years ago
- A high performance implementation of kmeans algorithm with cuda☆18Updated 10 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- This repository contains easy-to-read Python/CUDA implementations of fundamental GPU computing primitives.☆36Updated 9 years ago
- StarPU Runtime system☆16Updated 14 years ago
- A domain-specific language and compiler for image processing☆76Updated 4 years ago
- Custom fork containing our own python backend for integration into neon☆15Updated 2 years ago
- A GPU / CPU implementation of a feed forward neural network☆31Updated 10 years ago
- A short paper describing the library is available on arXiv.☆64Updated 7 years ago
- A framework for index based similarity search.☆19Updated 6 years ago
- Code accompanying my blog post on k-means in Python, C++ and CUDA☆58Updated 7 years ago
- Fast matrix multiplication☆29Updated 3 years ago