fengChenHPC / kmeans_cudaLinks
A high performance implementation of kmeans algorithm with cuda
☆18Updated 10 years ago
Alternatives and similar repositories for kmeans_cuda
Users that are interested in kmeans_cuda are comparing it to the libraries listed below
Sorting:
- A CUDA implementation of the PageRank Pipeline Benchmark☆32Updated 8 years ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 5 years ago
- Efficient LDA solution on GPUs.☆24Updated 6 years ago
- Fork of magma to include more BLAS☆28Updated 8 years ago
- A minimalistic header only C++11 Neural Network library based on Eigen::Tensor☆20Updated 7 years ago
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago
- CUDA Matrix Factorization Library with Stochastic Gradient Descent (SGD)☆71Updated 7 years ago
- parallel algorithm based on cuda☆60Updated 7 years ago
- Test winograd convolution written in TVM for CUDA and AMDGPU☆41Updated 6 years ago
- A Distributed Multi-GPU System for Fast Graph Processing☆65Updated 6 years ago
- Dolphin - a Deep Learning on MIC architecture Project.☆25Updated 10 years ago
- Base code and optimized code for the benchmarks used in the PolyMage paper published at ASPLOS 2015☆19Updated 9 years ago
- GraphMat graph analytics framework☆102Updated 2 years ago
- CUDA Sparse-Matrix Vector Multiplication, using Sliced Coordinate format☆22Updated 7 years ago
- GPU Optimization and Memory Abstraction Framework☆32Updated 5 years ago
- SRS - Fast Approximate Nearest Neighbor Search in High Dimensional Euclidean Space With a Tiny Index☆55Updated 10 years ago
- CuSha is a CUDA-based vertex-centric graph processing framework that uses G-Shards and CW representations.☆52Updated 9 years ago
- This is a tuned sparse matrix dense vector multiplication(SpMV) library☆21Updated 9 years ago
- ICML2017 MEC: Memory-efficient Convolution for Deep Neural Network C++实现(非官方)☆17Updated 6 years ago
- ONNX Parser is a tool that automatically generates openvx inference code (CNN) from onnx binary model files.☆18Updated 6 years ago
- Medusa: Building GPU-based Parallel Sparse Graph Applications with Sequential C/C++ Code☆61Updated 4 years ago
- The "CUDA templates" are a collection of C++ template classes and functions which provide a consistent interface to NVIDIA's "Compute Uni…☆27Updated 13 years ago
- ☆15Updated 7 years ago
- Training a Tensorflow graph in C++☆25Updated 8 years ago
- High Dimensional Approximate Near(est) Neighbor☆33Updated 7 years ago
- GPU Automatically Tuned Linear Algebra Software☆28Updated 9 years ago
- CNNs in Halide☆23Updated 9 years ago
- image to column☆30Updated 10 years ago
- Sparse matrix computation library for GPU☆56Updated 4 years ago
- Kernel Fusion and Runtime Compilation Based on NNVM☆70Updated 8 years ago