TanDongXu / CUDA-MCDNN
☆12Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for CUDA-MCDNN
- ☆24Updated 6 years ago
- Optimizing Mobile Deep Learning on ARM GPU with TVM☆179Updated 6 years ago
- A way to use cuda to accelerate top k algorithm☆29Updated 7 years ago
- ☆45Updated 2 years ago
- Deep Learning/GPU Architect/Autonomous Driving Positions☆80Updated 4 years ago
- ☆127Updated 6 years ago
- CNN accelerated by cuda. Test on mnist and finilly get 99.76%☆184Updated 7 years ago
- Ristretto: Caffe-based approximation of convolutional neural networks.☆30Updated 6 years ago
- Caffe with NNPACK integration☆59Updated 8 years ago
- tutorial to optimize GEMM performance on android☆51Updated 8 years ago
- Binary Weight Network and XNOR Network.☆63Updated 8 years ago
- TensorFlow and TVM integration☆38Updated 4 years ago
- Benchmark of TVM quantized model on CUDA☆112Updated 4 years ago
- Tensorflow Serving with support for Caffe☆41Updated 7 years ago
- Caffe for Sparse Convolutional Neural Network☆238Updated last year
- Fork of https://source.codeaurora.org/quic/hexagon_nn/nnlib☆54Updated last year
- C++ interface for mxnet☆114Updated 7 years ago
- Heterogeneous Run Time version of MXNet. Added heterogeneous capabilities to the MXNet, uses heterogeneous computing infrastructure frame…☆72Updated 6 years ago
- Optimized half precision gemm assembly kernels (deprecated due to ROCm)☆47Updated 7 years ago
- Tensorflow to TensorRT Model Converter☆30Updated 6 years ago
- Caffe Computation Graph Optimization.☆29Updated 4 years ago
- Collective Knowledge repository for NVIDIA's TensorRT☆37Updated 3 years ago
- ☆25Updated 6 years ago
- This is a CNN Analyzer tool, based on Netscope by dgschwend/netscope☆40Updated 6 years ago
- mobilenet-mxnet☆145Updated 6 years ago
- Hopefully fast implementation of XNOR-Net in C, because, why not?☆26Updated 7 years ago