Simple MLP Neural Network example using OpenCL kernels that can run on the CPU or GPU, supports Elman and Jordan recurrent networks
☆11Feb 21, 2017Updated 9 years ago
Alternatives and similar repositories for OpenCL-NeuralNetwork
Users that are interested in OpenCL-NeuralNetwork are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official pytorch code for "APP: Anytime Progressive Pruning" (DyNN @ ICML, 2022; CLL @ ACML, 2022, SNN @ ICML, 2022 and SlowDNN 2023)☆16Nov 22, 2022Updated 3 years ago
- OpenRISC Conference Website☆15Aug 15, 2024Updated last year
- Perceptron-based branch predictor written in C++☆14Dec 14, 2016Updated 9 years ago
- ☆14Feb 14, 2022Updated 4 years ago
- 基于insightface训练mobilefacenet的调试步骤,更改模型后一层训练结果为99.683% in lfw and 96.717 in agedb. Now pls move to the new mobilefacenet-V2…☆11Aug 28, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The code for Joint Neural Architecture Search and Quantization☆14Apr 10, 2019Updated 7 years ago
- The implementation for "Salient Positions based Attention Network for Image Classification"☆17Jun 10, 2021Updated 4 years ago
- Repository for content for the AMLD2020 workshop "Spiking neural networks for real-time inference tasks"☆17Oct 28, 2020Updated 5 years ago
- Visualize TVM Relay program graph☆12Nov 19, 2019Updated 6 years ago
- Speech Recognition using DeepSpeech2.☆17Nov 19, 2019Updated 6 years ago
- Code for LIT, ICML 2019☆22Jun 11, 2019Updated 6 years ago
- The Stream-51 dataset for streaming classification and novelty detection from videos.☆17Feb 22, 2022Updated 4 years ago
- FPGA accelerator on GNU Radio and Zynq SoC☆16Feb 23, 2017Updated 9 years ago
- TAPA is a dataflow HLS framework that features fast compilation, expressive programming model and generates high-frequency FPGA accelerat…☆19Aug 26, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for the pubblication "Distilled Replay: Overcoming Forgetting through Synthetic Examples"☆12Apr 1, 2021Updated 5 years ago
- [ICLR 2024 Spotlight] 🚀 The official repository of Self-Supervised Learning method "ROPIM", "Pre-training with Random Orthogonal Project…☆10Jan 15, 2025Updated last year
- ☆19Dec 10, 2021Updated 4 years ago
- TMMA: A Tiled Matrix Multiplication Accelerator for Self-Attention Projections in Transformer Models, optimized for edge deployment on Xi…☆34Apr 7, 2026Updated 2 months ago
- ☆11Aug 4, 2020Updated 5 years ago
- ☆15Feb 27, 2024Updated 2 years ago
- A branch predictor simulator in C++ that tests 6 different types of branch predictors.☆13Apr 26, 2018Updated 8 years ago
- c++ version of ViT☆12Nov 13, 2022Updated 3 years ago
- Kratos: An FPGA Benchmark for Unrolled Deep Neural Networks with Fine-Grained Sparsity and Mixed Precision☆12Jan 19, 2026Updated 4 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Example of Matrix Multiplication using Map Reduce paradigm in python☆10Oct 25, 2016Updated 9 years ago
- Code for paper "Energy-Constrained Compression for Deep Neural Networks via Weighted Sparse Projection and Layer Input Masking"☆18May 7, 2019Updated 7 years ago
- Binary Neural Network-based COVID-19 Face-Mask Wear and Positioning Predictor on Edge Devices☆12Jul 1, 2021Updated 4 years ago
- Code for paper "Spider: Any-to-Many Multimodal LLM"☆16Apr 26, 2025Updated last year
- Chinese Guide for Alveo Getting Started☆12May 18, 2020Updated 6 years ago
- An optimized Merkle Patricia Trie implementation on GPU, fully compatible with and integrable into Ethereum. The paper is published on VL…☆14Apr 15, 2024Updated 2 years ago
- [DATE'2025, TCAD'2025] Terafly : A Multi-Node FPGA Based Accelerator Design for Efficient Cooperative Inference in LLMs☆37Nov 13, 2025Updated 6 months ago
- Computer vision framework based on deep learning and GPU programming☆17Jun 16, 2019Updated 6 years ago
- Multi Layer Perceptron by Vivado HLS for Xilinx FPGA implementation☆12Dec 26, 2016Updated 9 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Fever prediction model using high-frequency real-time sensor data☆14Sep 15, 2020Updated 5 years ago
- Official implementation of the paper "Robust and Resource-Efficient Data-Free Knowledge Distillation by Generative Pseudo Replay" (AAAI-2…☆18May 5, 2022Updated 4 years ago
- Python Implementation of Mini DFS☆15Jun 24, 2018Updated 7 years ago
- Includes the SVD-based approximation algorithms for compressing deep learning models and the FPGA accelerators exploiting such approximat…☆16Mar 3, 2023Updated 3 years ago
- GPGPU-SIM 使用篇☆14Nov 12, 2022Updated 3 years ago
- Real-time panorama and image stitching using c++ and openCV CUDA☆12Sep 8, 2021Updated 4 years ago
- Code for reproducing "AC/DC: Alternating Compressed/DeCompressed Training of Deep Neural Networks" (NeurIPS 2021)☆23Nov 9, 2021Updated 4 years ago