Idein / qmkl
Math Kernel Library for VideoCore IV QPU
☆68Updated 6 years ago
Alternatives and similar repositories for qmkl:
Users that are interested in qmkl are comparing it to the libraries listed below
- Compiler for the VC4CL OpenCL implementation☆118Updated last year
- Python library for GPGPU programming on Raspberry Pi 4☆252Updated 2 months ago
- A Raspberry Pi GPU-accelerated implementation of the GEMM matrix-multiply function☆88Updated 10 years ago
- BLAS library for VideoCore VI QPU (Raspberry Pi 4)☆66Updated 2 years ago
- An assembler/disassembler for the QPU processors on the Raspberry Pi☆121Updated 9 years ago
- GPU-side implementation of the OpenCL standard-library for VC4CL☆42Updated 3 years ago
- Chainer x TensorRT☆34Updated 6 years ago
- Raspberry Pi Projects☆84Updated 8 years ago
- A memory manager for Raspberry Pi☆11Updated 2 years ago
- Caffe: a fast open framework for deep learning.☆43Updated 8 years ago
- Accelerating DNN Convolutional Layers with Micro-batches☆63Updated 4 years ago
- A Python driver for VideoCore Shared Memory (VCSM) of Raspberry Pi☆25Updated 3 years ago
- DeepDetect performance sheet☆93Updated 5 years ago
- Python library for GPGPU on Raspberry Pi☆801Updated last year
- Fork of darknet-nnpack☆96Updated 6 years ago
- A prototype implementation of AllReduce collective communication routine.☆19Updated 6 years ago
- 「Raspberry Pi GPGPU入門」のリポジトリ☆23Updated 3 years ago
- DNN Inference with CPU, C++, ONNX support: Instant☆56Updated 6 years ago
- Docker images that support different OpenCl Runtime☆34Updated 8 years ago
- Add-on package for ONNX format support in Chainer☆85Updated 5 years ago
- NNPACK for Darknet☆33Updated 7 years ago
- Demitasse: SPMD Programing Implementation of Deep Neural Network Library for Mobile Devices(NeurIPS2016WS)☆23Updated 8 years ago
- Scripts to install TensorFlow on the NVIDIA Jetson TX1 Development Kit☆62Updated 7 years ago
- experimental binary net implementation in chainer☆101Updated 9 years ago
- MobileNet-SSD(MobileNetSSD) + Neural Compute Stick(NCS) Faster than YoloV2 + Explosion speed by RaspberryPi · Multiple moving object dete…☆92Updated 6 years ago
- This fork of the deep learning guide has been adapted to work with a variety of different inputs USB camera, GigEVision and RTP on the T…☆36Updated 7 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 7 years ago
- chainer implementation of YOLO☆15Updated 7 years ago
- This repository is test code for comparison of several deep learning frameworks.☆77Updated 6 years ago
- nGraph™ Backend for ONNX☆42Updated 2 years ago