Model LLM inference on single-core dataflow accelerators
☆18Dec 16, 2025Updated 4 months ago
Alternatives and similar repositories for zigzag-llm
Users that are interested in zigzag-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multi-core HW accelerator mapping optimization framework for layer-fused ML workloads.☆64Jul 5, 2025Updated 9 months ago
- A heterogeneous accelerator-centric compute cluster☆40Updated this week
- HW Architecture-Mapping Design Space Exploration Framework for Deep Learning Accelerators☆189Jan 23, 2026Updated 2 months ago
- Simulator for LLM inference on an abstract 3D AIMC-based accelerator☆28Sep 18, 2025Updated 7 months ago
- Driving Snax with MLIR☆21Updated this week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A framework for fast exploration of the depth-first scheduling space for DNN accelerators☆43Feb 8, 2023Updated 3 years ago
- Efficient Neural Network Deployment on Heterogenous TinyML Platforms☆16Sep 25, 2023Updated 2 years ago
- Originally at https://github.com/Scrawk/CGALUnity - thought I'd preserve a copy in case someone needed it☆11Dec 20, 2021Updated 4 years ago
- ☆21May 13, 2024Updated last year
- This is my hobby project with System Verilog to accelerate LeViT Network which contain CNN and Attention layer.☆36Aug 13, 2024Updated last year
- PDPU: An Open-Source Posit Dot-Product Unit for Deep Learning Applications☆44May 5, 2023Updated 2 years ago
- ☆11Jun 29, 2021Updated 4 years ago
- HW accelerator mapping optimization framework for in-memory computing☆29Jun 3, 2025Updated 10 months ago
- Panda with Deep Reinforcement Learning Simulation Environment Webots☆10Apr 29, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Github repository of HPCA 2025 paper "UniNDP: A Unified Compilation and Simulation Tool for Near DRAM Processing Architectures"☆20Jan 18, 2026Updated 3 months ago
- Single RISC-V CPU attached on AMBA AHB with Instruction and Data memories.☆13Oct 31, 2021Updated 4 years ago
- ⛰ A simple tourism app UI & With dummy data.☆13Jan 4, 2020Updated 6 years ago
- H2-LLM: Hardware-Dataflow Co-Exploration for Heterogeneous Hybrid-Bonding-based Low-Batch LLM Inference☆98Apr 26, 2025Updated 11 months ago
- This repository contains the hardware implementation for Static BFP convolution on FPGA☆10Oct 15, 2019Updated 6 years ago
- ☆14Jul 6, 2022Updated 3 years ago
- A systolic array simulator for multi-cycle MACs and varying-byte words, with the paper accepted to HPCA 2022.☆84Nov 7, 2021Updated 4 years ago
- Cross-platform C++ Starter Project Template with Premake5, GoogleTest, spdlog, Standalone Asio, and CI integration. Ready to use out of t…☆19Nov 3, 2024Updated last year
- This code is for our ICML 2020 paper "On the Number of Linear Regions of Convolutional Neural Networks."☆13Aug 5, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- RISCV lock-step checker based on Spike☆14Mar 6, 2026Updated last month
- ☆92Jan 4, 2026Updated 3 months ago
- ☆14Oct 8, 2024Updated last year
- ☆12Jan 19, 2022Updated 4 years ago
- [HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning☆128Aug 27, 2024Updated last year
- An out-of-order processor that supports multiple instruction sets.☆22Aug 23, 2022Updated 3 years ago
- ☆17Mar 8, 2025Updated last year
- Source code for GreaTer ICLR 2025 - Gradient Over Reasoning makes Smaller Language Models Strong Prompt Optimizers☆36Apr 18, 2025Updated last year
- Implementation of weight stationary systolic array which has a size of 4x4(scalable) to 256X256☆30Feb 21, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A high-efficiency system-on-chip for floating-point compute workloads.☆45Jan 13, 2025Updated last year
- BlockCIrculantRNN (LSTM and GRU) using TensorFlow☆14Oct 30, 2018Updated 7 years ago
- ☆12Jun 22, 2023Updated 2 years ago
- ☆14Jun 4, 2024Updated last year
- [HPCA 2023] ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design☆129Jun 27, 2023Updated 2 years ago
- MICRO 2024 Evaluation Artifact for FuseMax☆17Aug 26, 2024Updated last year
- Verilog Project☆21Aug 30, 2021Updated 4 years ago