RC4ML/Shuhai

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RC4ML/Shuhai)

RC4ML / Shuhai

Shuhai is a benchmarking-memory tool that allows FPGA programmers to demystify all the underlying details of memories, e.g., HBM and DDR4, on a Xilinx FPGA [FCCM 20]

☆117

Alternatives and similar repositories for Shuhai

Users that are interested in Shuhai are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RC4ML / FpgaNIC
View on GitHub
FpgaNIC is an FPGA-based Versatile 100Gb SmartNIC for GPUs [ATC 22]
☆143Aug 17, 2023Updated 2 years ago
CGCL-codes / ScalaBFS
View on GitHub
A Scalable BFS Accelerator on FPGA-HBM Platform
☆15Feb 22, 2024Updated 2 years ago
os-hxfan / Static_BFP_HW
View on GitHub
This repository contains the hardware implementation for Static BFP convolution on FPGA
☆10Oct 15, 2019Updated 6 years ago
UCLA-VAST / hbmbench
View on GitHub
☆24Dec 1, 2020Updated 5 years ago
RC4ML / Hyperion
View on GitHub
Cost-efficient Out-of-core GNN Training System on TB-scale Graph [ICDE 25]
☆22Jan 6, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Xtra-Computing / ThunderGP
View on GitHub
HLS-based Graph Processing Framework on FPGAs
☆152Oct 11, 2022Updated 3 years ago
MEEPproject / fpga_shell
View on GitHub
MEEP FPGA Shell project, currently supporting Alveos u280 and u55c
☆16Mar 14, 2024Updated 2 years ago
jiegec / chisel-memory-lower
View on GitHub
Lower chisel memories to SRAM macros
☆13Mar 25, 2024Updated 2 years ago
UCLA-VAST / tapa
View on GitHub
TAPA compiles task-parallel HLS program into high-performance FPGA accelerators. UCLA-maintained. Community-maintained version with binar…
☆189Mar 8, 2026Updated 2 months ago
Xilinx / Vitis_Accel_Examples
View on GitHub
Vitis_Accel_Examples
☆594Mar 30, 2026Updated last month
Xtra-Computing / ReGraph
View on GitHub
Scaling Graph Processing on HBM-enabled FPGAs with Heterogeneous Pipelines
☆22Aug 8, 2022Updated 3 years ago
abs-tudelft / vhsnunzip
View on GitHub
Hardware Snappy decompressor
☆12Sep 11, 2024Updated last year
BrianHGinc / Verilog-Floating-Point-Clock-Divider
View on GitHub
Provide / define the INPUT_CLK_HZ parameter and the BHG_FP_clk_divider.v will generate a clock at the specified CLK_OUT_HZ parameter usin…
☆22Feb 4, 2025Updated last year
UCLA-VAST / AutoBridge
View on GitHub
[FPGA 2021, Best Paper Award] An automated floorplanning and pipelining tool for Vivado HLS.
☆127Jan 3, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Xilinx / xup_vitis_network_example
View on GitHub
VNx: Vitis Network Examples
☆159Aug 25, 2025Updated 8 months ago
maltanar / fpga-booleanring-bfs
View on GitHub
Hybrid BFS on Xilinx Zynq
☆18Jun 9, 2015Updated 10 years ago
ChengZhang-98 / llm-mixed-q
View on GitHub
Official implementation of EMNLP'23 paper "Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?"
☆24Oct 25, 2023Updated 2 years ago
Xilinx / Get_Moving_With_Alveo
View on GitHub
For publishing the source for UG1352 "Get Moving with Alveo"
☆51Jun 17, 2020Updated 5 years ago
RC4ML / Legion
View on GitHub
GPU-initiated Large-scale GNN System [ATC 23]
☆19Oct 30, 2024Updated last year
Liu-Cheng / graph_accelerator
View on GitHub
Graph accelerator on FPGAs and ASICs
☆11Aug 16, 2018Updated 7 years ago
ic-lab-duth / Fast-Float4HLS
View on GitHub
Fast Floating Point Operators for High Level Synthesis
☆25Feb 23, 2023Updated 3 years ago
m-asiatici / MSHR-rich
View on GitHub
A multi-banked non-blocking cache that handles efficiently thousands of outstanding misses, especially suited for bandwidth-bound latency…
☆21Dec 3, 2020Updated 5 years ago
Xilinx / AlveoLink
View on GitHub
This repository contains IPs, Vitis kernels and software APIs that can be leveraged by Vitis users to build scale-out solutions on multip…
☆25Apr 27, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
doctor3w / HLS-Cryptography-Accelerator
View on GitHub
A crypto accelerator written for HLS to an FPGA that actually makes it slower than running it on your computer
☆18Dec 11, 2018Updated 7 years ago
RC4ML / LoHan
View on GitHub
A low-cost, high-performance deep learning training framework that enables efficient 100B-scale model fine-tuning on a commodity server w…
☆23Mar 21, 2025Updated last year
TUD-ADS / HiFlipVX
View on GitHub
☆20Aug 9, 2022Updated 3 years ago
KULeuven-COSIC / fpt-demo
View on GitHub
FPT: a Fixed-Point Accelerator for Torus Fully Homomorphic Encryption
☆29Sep 2, 2025Updated 8 months ago
Xilinx / ACCL
View on GitHub
Alveo Collective Communication Library: MPI-like communication operations for Xilinx Alveo accelerators
☆103Jun 30, 2025Updated 10 months ago
CMU-SAFARI / ramulator
View on GitHub
A Fast and Extensible DRAM Simulator, with built-in support for modeling many different DRAM technologies including DDRx, LPDDRx, GDDRx, …
☆693Aug 29, 2023Updated 2 years ago
fpgasystems / hacc
View on GitHub
ETHZ Heterogeneous Accelerated Compute Cluster.
☆41Oct 7, 2025Updated 7 months ago
SFU-HiAccel / uBench
View on GitHub
[FPGA'21] Microbenchmarks for Demystifying the Memory System of Modern Datacenter FPGAs for Software Programmers
☆31Dec 16, 2021Updated 4 years ago
fpgasystems / Vitis_with_100Gbps_TCP-IP
View on GitHub
100 Gbps TCP/IP stack for Vitis shells
☆232Apr 23, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Xilinx / open-nic-shell
View on GitHub
AMD OpenNIC Shell includes the HDL source files
☆140Jan 2, 2025Updated last year
dubcyfor3 / Focus
View on GitHub
[HPCA 2026 Best Paper Candidate] Official implementation of "Focus: A Streaming Concentration Architecture for Efficient Vision-Language …
☆51Feb 8, 2026Updated 3 months ago
fpgasystems / Coyote
View on GitHub
Framework providing operating system abstractions and a range of shared networking and memory services for common modern heterogeneous pl…
☆356May 11, 2026Updated last week
AlessandroCilardo / NaplesPU
View on GitHub
The official NaplesPU hardware code repository
☆24Jul 27, 2019Updated 6 years ago
cornell-zhang / HiSparse
View on GitHub
High-Performance Sparse Linear Algebra on HBM-Equipped FPGAs Using HLS
☆100Sep 27, 2024Updated last year
vickyiii / Quick-Start-Guide-for-HLS
View on GitHub
This is a series of quick start guide of Vitis HLS tool in Chinese. It explains the basic concepts and the most important optimize techni…
☆25Nov 9, 2022Updated 3 years ago
maestro-project / AIrchitect-v2
View on GitHub
[DATE 2025] Official implementation and dataset of AIrchitect v2: Learning the Hardware Accelerator Design Space through Unified Represen…
☆19Jan 17, 2025Updated last year