al42and / cuda-smi
Simple utility to show nVidia GPU memory usage wrt. CUDA device IDs.
☆38Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for cuda-smi
- Code examples for CUDA and OpenACC☆34Updated 3 months ago
- The Operator Vectorization Library, or OVL, is a python productivity library for defining high performance custom operators for the Tenso…☆68Updated 7 years ago
- Python Binding to NVRTC☆79Updated last month
- Microway's improved version of GPU Burn☆86Updated 3 months ago
- Fork of magma to include more BLAS☆28Updated 8 years ago
- clang with OpenMP 3.1 and some elements of OpenMP 4.0 support☆91Updated 9 years ago
- Multi-core CPU implementation of deep learning for 2D and 3D sliding window convolutional networks (ConvNets).☆94Updated 7 years ago
- Convolution op for Theano based on CuFFT using scikits.cuda☆51Updated 10 years ago
- A very simple camera interface (frame grabber) for Torch7.☆34Updated 7 years ago
- Python wrappers for the NVIDIA cuDNN libraries☆140Updated 7 years ago
- Benchmarks for CNTK and other toolkits.☆44Updated 8 years ago
- Speeding up and debittering Caffe by adding Halide☆18Updated 9 years ago
- Scientific library for high-precision computations and research☆50Updated 7 years ago
- Workshop on the future of gradient-based machine learning software, NIPS 2017, 2016☆15Updated 6 years ago
- Boost.Python interface for NumPy; now deprecated in factor of the version in Boost.Python itself.☆151Updated 6 years ago
- TensorFlow util for building memory usage timeline from LOG_MEMORY messages☆65Updated 6 years ago
- CuPy Benchmark☆12Updated 5 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- Fast autodiff.☆19Updated 9 years ago
- Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets☆29Updated 8 years ago
- easy embeddable Torch7 networks☆35Updated 8 years ago
- Torch is a scientific computing framework with wide support for machine learning algorithms. It is easy to use and efficient, thanks to a…☆38Updated 2 years ago
- FluidNet re-written with ATen tensor lib☆51Updated 5 years ago
- miniplaces2 deep residual network in neon☆16Updated 8 years ago
- a fully-differentiable graphical raytracer☆15Updated 9 years ago
- expresso☆44Updated 8 years ago
- Standalone C TH library☆57Updated 7 years ago