Idein / qmklLinks
Math Kernel Library for VideoCore IV QPU
☆69Updated 7 years ago
Alternatives and similar repositories for qmkl
Users that are interested in qmkl are comparing it to the libraries listed below
Sorting:
- A Raspberry Pi GPU-accelerated implementation of the GEMM matrix-multiply function☆88Updated 10 years ago
- Python library for GPGPU programming on Raspberry Pi 4☆257Updated 5 months ago
- Compiler for the VC4CL OpenCL implementation☆118Updated 2 years ago
- BLAS library for VideoCore VI QPU (Raspberry Pi 4)☆65Updated 3 years ago
- An assembler/disassembler for the QPU processors on the Raspberry Pi☆120Updated 9 years ago
- ChainerPruner: Channel Pruning framework for Chainer☆21Updated 5 years ago
- A Python driver for VideoCore Shared Memory (VCSM) of Raspberry Pi☆25Updated 4 years ago
- experimental binary net implementation in chainer☆102Updated 9 years ago
- Chainer x TensorRT☆34Updated 6 years ago
- A memory manager for Raspberry Pi☆12Updated 3 weeks ago
- DNN Inference with CPU, C++, ONNX support: Instant☆56Updated 6 years ago
- OpenCL implementation running on the VideoCore IV GPU of the Raspberry Pi models☆737Updated 2 years ago
- A prototype implementation of AllReduce collective communication routine.☆19Updated 6 years ago
- Accelerating DNN Convolutional Layers with Micro-batches☆63Updated 5 years ago
- 「Raspberry Pi GPGPU入門」のリポジトリ☆23Updated 3 years ago
- Add-on package for ONNX format support in Chainer☆85Updated 5 years ago
- DeepDetect performance sheet☆93Updated 5 years ago
- GPU-side implementation of the OpenCL standard-library for VC4CL☆41Updated 3 years ago
- NNPACK for Darknet☆33Updated 8 years ago
- Demitasse: SPMD Programing Implementation of Deep Neural Network Library for Mobile Devices(NeurIPS2016WS)☆23Updated 8 years ago
- Language and compiler for the Raspberry Pi GPU☆435Updated 4 years ago
- chainer implementation of YOLO☆15Updated 7 years ago
- Experimental toolchain to compile and run Chainer models☆113Updated 5 years ago
- Caffe: a fast open framework for deep learning.☆43Updated 9 years ago
- Heterogeneous Run Time version of TensorFlow. Added heterogeneous capabilities to the TensorFlow, uses heterogeneous computing infrastruc…☆36Updated 7 years ago
- Intel® Optimization for Chainer*☆82Updated 2 years ago
- Fork of darknet-nnpack☆95Updated 6 years ago
- ChainerMN: Scalable distributed deep learning with Chainer☆206Updated 6 years ago
- ☆20Updated 8 years ago
- Math Kernel Library for VideoCore IV QPU☆11Updated 7 years ago