OrcusCZ / SVMbenchmarkLinks
CUDA and OpenCL SVM training benchmark
☆16Updated 8 years ago
Alternatives and similar repositories for SVMbenchmark
Users that are interested in SVMbenchmark are comparing it to the libraries listed below
Sorting:
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆300Updated 7 years ago
- Machine Learning Toolkit for Extreme Scale (MaTEx)☆111Updated 7 years ago
- CUDA Matrix Factorization Library with Stochastic Gradient Descent (SGD)☆71Updated 8 years ago
- Intel(R) Machine Learning Scaling Library is a library providing an efficient implementation of communication patterns used in deep learn…☆108Updated 3 years ago
- GPU-specialized parameter server for GPU machine learning.☆102Updated 7 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆137Updated 8 years ago
- Kernel Fusion and Runtime Compilation Based on NNVM☆73Updated 9 years ago
- High Efficiency Convolution Kernel for Maxwell GPU Architecture☆137Updated 8 years ago
- A CUDA implementation of the k-means clustering algorithm☆255Updated 13 years ago
- Caffe: a fast open framework for deep learning.☆14Updated 10 years ago
- A CUDNN minimal deep learning training code sample using LeNet.☆269Updated 2 years ago
- An analytical performance modeling tool for deep neural networks.☆92Updated 5 years ago
- CUDA Matrix Factorization Library with Alternating Least Square (ALS)☆181Updated 7 years ago
- a heterogeneous multiGPU level-3 BLAS library☆46Updated 6 years ago
- Benchmark for Co-running Single Applications on Integrated Architectures☆12Updated 9 years ago
- Bridge to connect nGraph with TensorFlow☆52Updated 3 years ago
- Benchmarking State-of-the-Art Deep Learning Software Tools☆171Updated 8 years ago
- CLTune: An automatic OpenCL & CUDA kernel tuner☆184Updated 3 years ago
- Original Python version of Intel® Nervana™ Graph☆214Updated 3 years ago
- CuSha is a CUDA-based vertex-centric graph processing framework that uses G-Shards and CW representations.☆53Updated 10 years ago
- Reference workloads for modern deep learning methods.☆73Updated 3 years ago
- An extensible framework for program autotuning☆426Updated last week
- Optimized half precision gemm assembly kernels (deprecated due to ROCm)☆47Updated 8 years ago
- Repository for SysML19 Artifacts Evaluation☆54Updated 6 years ago
- Documentation for StreamExecutor open source proposal☆83Updated 9 years ago
- Symbolic Expression and Statement Module for new DSLs☆205Updated 5 years ago
- communication-efficient distributed coordinate ascent☆90Updated 6 years ago
- Minerva: a fast and flexible tool for deep learning on multi-GPU. It provides ndarray programming interface, just like Numpy. Python bind…☆705Updated 7 years ago
- GPU implementation of Winograd convolution☆10Updated 8 years ago
- Intel® Optimization for Chainer*, a Chainer module providing numpy like API and DNN acceleration using MKL-DNN.☆174Updated 3 weeks ago