goldsborough / k-means
Code accompanying my blog post on k-means in Python, C++ and CUDA
☆58Updated 7 years ago
Alternatives and similar repositories for k-means:
Users that are interested in k-means are comparing it to the libraries listed below
- Deep neural network framework (C/C++/CUDA).☆31Updated 9 years ago
- Direct C++ Interface to PyTorch☆80Updated 6 years ago
- OpenCL Inference Engine for pytorch☆51Updated 7 years ago
- Python Binding to NVRTC☆79Updated 5 months ago
- Proximal Asynchronous SAGA☆12Updated 7 years ago
- Training neural networks with 8-bit computations☆28Updated 8 years ago
- PyTorch Framework Integration for Tensor Comprehensions☆14Updated 7 years ago
- Training deep neural networks with low precision multiplications☆63Updated 9 years ago
- Code for "Aggregated Momentum: Stability Through Passive Damping", Lucas et al. 2018☆34Updated 6 years ago
- This is a PyTorch implementation of the Scalpel. Node pruning for five benchmark networks and SIMD-aware weight pruning for LeNet-300-100…☆41Updated 6 years ago
- Simple example of implementing a new Tensorflow operation and its gradient in C++.☆56Updated 6 years ago
- This repository contains several tools useful for pytorch users.☆46Updated 6 years ago
- A CUDA implementation of the ZeroOut tensorflow custom op, just for fun☆11Updated 8 years ago
- Implementing Google's DistBelief paper☆109Updated 2 years ago
- Implementation of fast exact k-means algorithms☆46Updated 5 years ago
- Distributed Learning by Pair-Wise Averaging☆53Updated 7 years ago
- ☆16Updated 7 years ago
- ☆13Updated 5 years ago
- FastHOG library that has been fixed to work with CUDA 5.x on Ubuntu 12.04☆20Updated 11 years ago
- Minimal Deep Learning library is written in Python/Cython/C++ and Numpy/CUDA/cuDNN.☆102Updated 7 years ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Updated 7 years ago
- Caffe version of code for our paper "Joint unsupervised learning of deep representations and image clusters"☆16Updated 7 years ago
- Symbolic differentiation engine for optimization-based machine learning models.☆42Updated 7 years ago
- Test winograd convolution written in TVM for CUDA and AMDGPU☆41Updated 6 years ago
- Personal collection of references for high performance mixed precision training.☆41Updated 5 years ago
- PyProf2: PyTorch Profiling tool☆82Updated 4 years ago
- An ONNX backend using PlaidML☆28Updated 6 years ago
- ☆13Updated 7 years ago
- This is the code used in the paper "Diagonal RNNs in Symbolic Music Modeling"☆17Updated 7 years ago
- SqueezeNet Generator☆31Updated 6 years ago