☆20Jul 22, 2022Updated 3 years ago
Alternatives and similar repositories for FastCNN
Users that are interested in FastCNN are comparing it to the libraries listed below
Sorting:
- ☆28Jun 30, 2025Updated 8 months ago
- Yet another Polyhedra Compiler for DeepLearning☆19Apr 14, 2023Updated 2 years ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Updated this week
- ☆20Sep 28, 2024Updated last year
- This is a demo how to write a high performance convolution run on apple silicon☆57Feb 8, 2022Updated 4 years ago
- Tile-based language built for AI computation across all scales☆138Updated this week
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆27Nov 7, 2019Updated 6 years ago
- DDK for Rockchip NPU☆69Dec 29, 2020Updated 5 years ago
- An auxiliary project analysis of the characteristics of KV in DiT Attention.☆33Nov 29, 2024Updated last year
- SpV8 is a SpMV kernel written in AVX-512. Artifact for our SpV8 paper @ DAC '21.☆29Mar 16, 2021Updated 4 years ago
- Hop-Wise Graph Attention for Scalable and Generalizable Learning on Circuits☆35Aug 25, 2024Updated last year
- Implementation of FusedMM method for IPDPS 2021 paper titled "FusedMM: A Unified SDDMM-SpMM Kernel for Graph Embedding and Graph Neural N…☆31Aug 12, 2022Updated 3 years ago
- Code for reproducing work of ICML 2019 paper: Memory-Optimal Direct Convolutions for Maximizing Classification Accuracy in Embedded Appli…☆12Jun 8, 2019Updated 6 years ago
- Kinematic and dynamic models of continuum and articulated soft robots.☆15Nov 22, 2025Updated 3 months ago
- 🌈 Solutions of LeetGPU☆72Feb 4, 2026Updated 3 weeks ago
- xkDLA:XinKai Deep Learning Accelerator (RTL)☆39Jan 15, 2024Updated 2 years ago
- Frame-agnostic XAI Library for Computer Vision, for understanding why models behave that way.☆11Feb 19, 2023Updated 3 years ago
- ☆13Jul 16, 2024Updated last year
- BERT Sentiment Classification on the IMDb Large Movie Review Dataset.☆16Sep 8, 2022Updated 3 years ago
- Code for the paper "Faster Neural Network Training with Approximate Tensor Operations"☆10Oct 23, 2021Updated 4 years ago
- ☆13Sep 5, 2024Updated last year
- An artificial matrix generator in C☆12Feb 16, 2023Updated 3 years ago
- ☆14Apr 14, 2025Updated 10 months ago
- arm-neon☆92Aug 2, 2024Updated last year
- This simulator models multi core systems, intended primarily for studies on main memory management techniques. It models a trace-based ou…☆12Jan 18, 2016Updated 10 years ago
- Proof of Concept to learn Amaranth as an entry effort for Supercon's RTL design competition☆10Nov 11, 2022Updated 3 years ago
- Write a cross_entropy function in pytorch to remove the abnormal nan value☆10Aug 22, 2019Updated 6 years ago
- A stream to RTL compiler based on MLIR and CIRCT☆16Nov 15, 2022Updated 3 years ago
- PolyLib official git.☆11Jan 27, 2026Updated last month
- ☆20Updated this week
- Musings in GEMM (General Matrix Multiplication)☆14Dec 14, 2025Updated 2 months ago
- CLI utilty to work out proper constants for vpternlogic instruction☆13Jan 22, 2023Updated 3 years ago
- Towards Hardware and Software Continuous Integration☆13Jun 8, 2020Updated 5 years ago
- OpenAI Whisper demo on Axera☆14Jan 15, 2026Updated last month
- Python scripts for WIDER FACE Evaluation☆10May 25, 2019Updated 6 years ago
- ☆11Aug 16, 2019Updated 6 years ago
- 4-bit Shampoo for Memory-Efficient Network Training (NeurIPS 2024)☆13Feb 13, 2025Updated last year
- ☆11Apr 3, 2023Updated 2 years ago
- A merged read deduplication tool capable to perform merged read deduplication on single end data.☆12Sep 4, 2024Updated last year