sjperkins / tfopgen
Generate C++ and CUDA boilerplate for tensorflow custom operators
☆20Updated 6 years ago
Related projects: ⓘ
- NNVM for ROCm Examples☆19Updated 6 years ago
- ☆36Updated 5 years ago
- Distributed Learning by Pair-Wise Averaging☆53Updated 6 years ago
- Python Binding to NVRTC☆79Updated 6 years ago
- Benchmarks for CNTK and other toolkits.☆44Updated 8 years ago
- TensorFlow util for building memory usage timeline from LOG_MEMORY messages☆65Updated 6 years ago
- ☆28Updated 6 years ago
- Python bindings for pyNVML and psutil library over network☆50Updated 9 months ago
- Direct C++ Interface to PyTorch☆80Updated 6 years ago
- Tensorflow implementation of SGD with Coupled Adaptive Batch Size (CABS)☆43Updated 7 years ago
- Tools to convert Caffe models to neon's serialization format☆39Updated last year
- easy embeddable Torch7 networks☆35Updated 8 years ago
- Multi-core CPU implementation of deep learning for 2D and 3D sliding window convolutional networks (ConvNets).☆94Updated 7 years ago
- ☆35Updated 7 years ago
- train on AWS☆75Updated 6 years ago
- PyTorch development for onnx☆21Updated 6 years ago
- A rudimentary wrapper around the fast Maxwell kernels for GEMM and convolution operations provided by nervanagpu☆34Updated 9 years ago
- ☆58Updated 8 years ago
- The Operator Vectorization Library, or OVL, is a python productivity library for defining high performance custom operators for the Tenso…☆68Updated 7 years ago
- Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets☆28Updated 8 years ago
- Efficient layer normalization GPU kernel for Tensorflow☆111Updated 7 years ago
- MPI Parallel framework for training deep learning models built in Theano☆53Updated 7 years ago
- Training deep neural networks with low precision multiplications☆63Updated 9 years ago
- Training neural networks with 8-bit computations☆29Updated 8 years ago
- Implementation and demonstration of backdrop in pytorch. Code and demonstration of GP dataset generator.☆68Updated 6 years ago
- A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.☆39Updated 6 years ago
- Lightweight interface to AWS☆47Updated 4 years ago
- Convolution op for Theano based on CuFFT using scikits.cuda☆51Updated 10 years ago
- ONNX Integration Builds☆20Updated 6 years ago