SDA: Low-Bit Stable Diffusion Acceleration on Edge FPGAs
☆19May 23, 2024Updated last year
Alternatives and similar repositories for SDA_code
Users that are interested in SDA_code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [FCCM 2023] PASTA: Programming and Automation Support for Scalable Task-Parallel HLS Programs on Modern Multi-Die FPGAs☆14Jun 26, 2025Updated 10 months ago
- ☆13Apr 15, 2025Updated last year
- 从零快速使用Ubuntu,搭建深度学习环境,持续更新中☆11Apr 18, 2023Updated 3 years ago
- ☆14Aug 1, 2024Updated last year
- ☆14May 23, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆53Aug 28, 2024Updated last year
- ☆17Nov 20, 2022Updated 3 years ago
- RISC-V-based many-core neuromorphic architecture☆16Apr 13, 2026Updated 2 weeks ago
- Simulator for BitFusion☆102Aug 6, 2020Updated 5 years ago
- A Linux shell written in LEX/YACC☆11Apr 12, 2015Updated 11 years ago
- The Carnegie Mellon Robot Navigation Toolkit.☆12Sep 25, 2014Updated 11 years ago
- Mad Zombie Classic 4th☆13Oct 8, 2023Updated 2 years ago
- Unofficial PyTorch implementation of the paper "Conditional Channel Gated Networks for Task-Aware Continual Learning"☆20Jan 22, 2021Updated 5 years ago
- Torch2Chip (MLSys, 2024)☆56Apr 2, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- CNN simd based accelerator using Vitis HLS☆11Jul 15, 2022Updated 3 years ago
- A simple cycle-accurate DaDianNao simulator☆13Mar 27, 2019Updated 7 years ago
- A Fast DNN Accelerator Design Space Exploration Framework.☆46Aug 10, 2022Updated 3 years ago
- (Verilog) A simple convolution layer implementation with systolic array structure☆13May 9, 2022Updated 3 years ago
- NATSA is the first near-data-processing accelerator for time series analysis based on the Matrix Profile (SCRIMP) algorithm. NATSA exploi…☆16Jun 14, 2023Updated 2 years ago
- The official implementation of the DAC 2024 paper GQA-LUT☆22Dec 20, 2024Updated last year
- Instance segmentation of center pivot irrigation system in Brazil using Landsat images and Convolutional Neural Network☆11May 27, 2024Updated last year
- Allo Accelerator Design and Programming Framework (PLDI'24)☆373Mar 13, 2026Updated last month
- ☆12Nov 2, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- High-level synthesis (HLS) implementation of Sparse Matrix Vector Multiplication☆19Feb 17, 2022Updated 4 years ago
- The official implementation of the NeurIPS 2022 paper Q-ViT.☆105May 22, 2023Updated 2 years ago
- Implementation of convolution layer in different flavors☆68Oct 8, 2017Updated 8 years ago
- Artifact for IPDPS'21: DSXplore: Optimizing Convolutional Neural Networks via Sliding-Channel Convolutions.☆13Apr 6, 2021Updated 5 years ago
- HW Architecture-Mapping Design Space Exploration Framework for Deep Learning Accelerators☆191Updated this week
- This repository contains papers for a comprehensive survey on accelerated generation techniques in Large Language Models (LLMs).☆11May 24, 2024Updated last year
- FPGA implement of 8x8 weight stationary systolic array DNN accelerator☆18Feb 27, 2021Updated 5 years ago
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆16Mar 19, 2023Updated 3 years ago
- ☆26Nov 4, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Training wide residual networks for deployment using a single bit for each weight - Official Code Repository for ICLR 2018 Published Pape…☆36May 27, 2020Updated 5 years ago
- Implementation of the Winograd algorithm.☆24Nov 6, 2018Updated 7 years ago
- ☆32Oct 2, 2023Updated 2 years ago
- ☆10Jun 28, 2019Updated 6 years ago
- ☆13Jul 2, 2016Updated 9 years ago
- Floating point morton order comparison operator.☆17May 1, 2024Updated last year
- Alveo Versal Example Design☆65Jan 28, 2026Updated 3 months ago