clementfarabet / neuflowLinks
Compiler toolkit for neuFlow.
☆26Updated 12 years ago
Alternatives and similar repositories for neuflow
Users that are interested in neuflow are comparing it to the libraries listed below
Sorting:
- CaffePresso: An Optimized Library for Deep Learning on Embedded Accelerator-based platforms☆87Updated 9 months ago
- ☆119Updated 7 years ago
- training ternary neural networks☆15Updated 8 years ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆64Updated 5 years ago
- A domain-specific language and compiler for image processing☆76Updated 4 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆136Updated 8 years ago
- Library for fast image convolution in neural networks on Intel Architecture☆31Updated 8 years ago
- Binarized Neural Network TF training code + C matrix / eval library.☆101Updated 7 years ago
- ☆143Updated 6 years ago
- ☆35Updated 8 years ago
- An exploration of log domain "alternative floating point" for hardware ML/AI accelerators.☆394Updated 2 years ago
- ☆39Updated 8 years ago
- A Raspberry Pi GPU-accelerated implementation of the GEMM matrix-multiply function☆88Updated 11 years ago
- Binary Neural Network on IceStick FPGA.☆52Updated 7 years ago
- Rigel is a language for describing image processing hardware embedded in Lua. Rigel can compile to Verilog hardware designs for Xilinx FP…☆56Updated 4 years ago
- An assembler/disassembler for the QPU processors on the Raspberry Pi☆120Updated 9 years ago
- Training deep neural networks with low precision multiplications☆63Updated 10 years ago
- HLS branch of Halide☆77Updated 7 years ago
- Proof-of-Concept CNN in Halide☆22Updated 9 years ago
- OpenCL Demos for Xilinx FPGAs☆31Updated 9 years ago
- tutorial to optimize GEMM performance on android☆51Updated 9 years ago
- ☆62Updated 7 years ago
- Darkroom Core☆48Updated 7 years ago
- Base code and optimized code for the benchmarks used in the PolyMage paper published at ASPLOS 2015☆19Updated 9 years ago
- High Efficiency Convolution Kernel for Maxwell GPU Architecture☆134Updated 8 years ago
- Input-aware cuBLAS/clBLAS implementation for better performance☆17Updated 3 years ago
- Benchmarking matrix multiplication implementations☆100Updated 8 years ago
- Dynamically Allocated Neural Network Accelerator for the RISC-V Rocket Microprocessor in Chisel☆213Updated 5 years ago
- implementing a Recurrent Neural Network with binarized weight format on FPGA☆22Updated 7 years ago
- ☆22Updated 8 years ago