Library for fast image convolution in neural networks on Intel Architecture
☆30Jun 25, 2017Updated 8 years ago
Alternatives and similar repositories for FALCON
Users that are interested in FALCON are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Jul 7, 2017Updated 8 years ago
- Phase Fair and Standard Reader Writer Locks☆17Sep 16, 2019Updated 6 years ago
- Improved performance for TensorFlow on Intel hardware.☆13Jun 25, 2018Updated 7 years ago
- Winograd minimal convolution algorithm generator for convolutional neural networks.☆627Feb 9, 2026Updated 2 months ago
- ☆10Aug 4, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Portable 128-bit SIMD intrinsics☆59Jul 4, 2023Updated 2 years ago
- Create prototxt for variants of ResNet (including training and test)☆21May 28, 2018Updated 7 years ago
- A fast implementation of the ECMA-182 CRC64 checksum using the CLMUL instruction set☆15Nov 1, 2016Updated 9 years ago
- Torch FFI-bindings for NNPACK☆31May 26, 2017Updated 8 years ago
- C99/C++ header-only library for division via fixed-point multiplication by inverse☆60Apr 14, 2024Updated 2 years ago
- flexible-gemm conv of deepcore☆17Dec 2, 2019Updated 6 years ago
- ☆21Jan 21, 2026Updated 3 months ago
- Segmented Code Adjustment Quantization (SAQ)☆21Sep 22, 2025Updated 7 months ago
- TensorFlow frozen forward model to plain C++ converter☆10Aug 7, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆137Apr 20, 2017Updated 9 years ago
- Library and accelerator backend☆15Apr 25, 2026Updated last week
- ☆18Apr 8, 2022Updated 4 years ago
- Winograd-based convolution implementation in OpenCL☆28Jan 22, 2017Updated 9 years ago
- detect facial landmark with mini-caffe☆18Feb 23, 2017Updated 9 years ago
- A minimalist Deep Learning framework for embedded Computer Vision☆47Dec 31, 2019Updated 6 years ago
- ☆14Feb 7, 2020Updated 6 years ago
- CUDA and OpenMP implementations of C2R/R2C inplace transposition☆48Feb 10, 2015Updated 11 years ago
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆949Mar 18, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Runtime library for the NVIDIA RTXNTC SDK☆27Jan 22, 2026Updated 3 months ago
- ☆24Oct 17, 2016Updated 9 years ago
- Acceleration package for neural networks on multi-core CPUs☆1,704Jun 11, 2024Updated last year
- Caffe implementation of the paper "Deep Pyramidal Residual Networks" (https://arxiv.org/abs/1610.02915).☆27Jul 18, 2017Updated 8 years ago
- Open single and half precision gemm implementations☆397Apr 2, 2023Updated 3 years ago
- A simple tool to profile performance of multiple combinations of GEMM of cuBLAS☆25Feb 9, 2021Updated 5 years ago
- Depict GPU memory footprint during DNN training of PyTorch☆11Nov 17, 2022Updated 3 years ago
- An neural network library for C++☆12Jul 2, 2025Updated 10 months ago
- Implementation for <Neural Similarity Learning> in NeurIPS'19.☆33Aug 23, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- An Efficient Transport Estimator for Complex Layered Materials☆13May 28, 2020Updated 5 years ago
- The SparseX sparse kernel optimization library☆43Jan 16, 2019Updated 7 years ago
- A Winograd based kernel for convolutions in deep learning framework☆15Jul 22, 2017Updated 8 years ago
- 勉強会のスライド資料☆13Jun 12, 2016Updated 9 years ago
- pbft - Practical Byzantine Fault Tolerance☆13Jul 18, 2015Updated 10 years ago
- pycaffe version of RSA 'Recurrent Scale Approximation for Object Detection in CNN'☆32Dec 5, 2017Updated 8 years ago
- Parse TF-Lite Model File in C++☆13Aug 4, 2019Updated 6 years ago