goldsborough / k-means
Code accompanying my blog post on k-means in Python, C++ and CUDA
☆58Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for k-means
- Training neural networks with 8-bit computations☆29Updated 8 years ago
- This is a PyTorch implementation of the Scalpel. Node pruning for five benchmark networks and SIMD-aware weight pruning for LeNet-300-100…☆39Updated 6 years ago
- Slides and code for my talk at MeetingC++ 2017☆48Updated 7 years ago
- Direct C++ Interface to PyTorch☆80Updated 6 years ago
- Simple example of implementing a new Tensorflow operation and its gradient in C++.☆56Updated 5 years ago
- Caffe version of code for our paper "Joint unsupervised learning of deep representations and image clusters"☆16Updated 7 years ago
- Benzina is an image-loader package that greatly accelerates image loading onto GPUs using their built-in hardware codecs.☆35Updated last year
- PyProf2: PyTorch Profiling tool☆83Updated 4 years ago
- Deep neural network framework (C/C++/CUDA).☆31Updated 9 years ago
- Minimal Deep Learning library is written in Python/Cython/C++ and Numpy/CUDA/cuDNN.☆102Updated 6 years ago
- Symbolic differentiation engine for optimization-based machine learning models.☆42Updated 7 years ago
- The "CUDA templates" are a collection of C++ template classes and functions which provide a consistent interface to NVIDIA's "Compute Uni…☆27Updated 13 years ago
- Fast matrix multiplication for few-bit integer matrices on CPUs.☆27Updated 5 years ago
- Training deep neural networks with low precision multiplications☆63Updated 9 years ago
- Python Binding to NVRTC☆79Updated last month
- Deep learning with a multiplication budget☆47Updated 6 years ago
- This example builds on the parallel-forall repo separate compilation example by adding CMake to it.☆17Updated 6 years ago
- Implementation of fast exact k-means algorithms☆46Updated 5 years ago
- TensorFlow util for building memory usage timeline from LOG_MEMORY messages☆65Updated 6 years ago
- PyTorch Framework Integration for Tensor Comprehensions☆14Updated 6 years ago
- Test winograd convolution written in TVM for CUDA and AMDGPU☆40Updated 6 years ago
- Distributed Learning by Pair-Wise Averaging☆53Updated 7 years ago
- ONNX Parser is a tool that automatically generates openvx inference code (CNN) from onnx binary model files.☆17Updated 5 years ago
- A GPU / CPU implementation of a feed forward neural network☆33Updated 9 years ago
- ☆47Updated 4 years ago
- A very simple variant of adversarial training that yields excellent results on MNIST☆12Updated 8 years ago
- Demo codes in our presentation about MXNet in AWS Seoul Summit 2017☆13Updated 7 years ago
- This repository contains several tools useful for pytorch users.☆46Updated 5 years ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆63Updated 5 years ago
- Codebase associated with the PyTorch compiler tutorial☆44Updated 5 years ago