albertozeni / starlightView external linksLinks
Starlight: A Kernel Optimizer for GPU Processing
☆16Jan 10, 2024Updated 2 years ago
Alternatives and similar repositories for starlight
Users that are interested in starlight are comparing it to the libraries listed below
Sorting:
- Image Registration on FPGAs☆21Aug 28, 2022Updated 3 years ago
- Template Repository for Xilinx HLS design flow☆12Nov 18, 2021Updated 4 years ago
- Polyglot CUDA integration for the GraalVM☆18Apr 6, 2025Updated 10 months ago
- ☆12Apr 15, 2025Updated 9 months ago
- OpenMP front-end based on LLVM for CGRAs☆10Oct 2, 2022Updated 3 years ago
- A collection of Matplotlib and Seaborn recipes and utilities collected over years of colorful plot-making☆22Nov 17, 2023Updated 2 years ago
- ☆14Nov 30, 2023Updated 2 years ago
- A OpenCL-based FPGA benchmark suite for HPC☆37Jan 29, 2026Updated 2 weeks ago
- Tuning Assistant for Floating point to Fixed point Optimization☆19Mar 26, 2022Updated 3 years ago
- PYNQ bindings for C and C++ to avoid requiring Python or Vitis to execute hardware acceleration.☆28Dec 22, 2025Updated last month
- AIM: Accelerating Arbitrary-precision Integer Multiplication on Heterogeneous Reconfigurable Computing Platform Versal ACAP (Full Paper a…☆24May 18, 2025Updated 8 months ago
- Educational verilog library that supports IEEE754 floating point arithmetic with a parametrizable mantissa and exponent☆32Mar 13, 2025Updated 11 months ago
- ☆38Jul 6, 2025Updated 7 months ago
- Optimize GEMM with tensorcore step by step☆36Dec 17, 2023Updated 2 years ago
- An awesome curated list of languages and tools to program FPGAs☆73Jun 22, 2022Updated 3 years ago
- [FPGA'21] Microbenchmarks for Demystifying the Memory System of Modern Datacenter FPGAs for Software Programmers☆31Dec 16, 2021Updated 4 years ago
- ☆40Mar 26, 2020Updated 5 years ago
- ETHZ Heterogeneous Accelerated Compute Cluster.☆38Oct 7, 2025Updated 4 months ago
- Argonne Leadership Computing Facility OpenCL tutorial☆10Aug 22, 2025Updated 5 months ago
- Number Geometry methods: Shortest Vector Problem and Shorter Basis Problem in Lattice (Hamming distance, Bounded distance decoding, bina…☆13May 19, 2023Updated 2 years ago
- Pipeline used internally for Peter Bubenik's TDA Group at UF.☆11Nov 3, 2022Updated 3 years ago
- Sparse symmetric indefinite solver implemented with a runtime system☆13May 11, 2020Updated 5 years ago
- We implement the progressive Improved Progressive BKZ with Lattice Sieving presented in https://eprint.iacr.org/2022/1343, one can call i…☆13Feb 14, 2025Updated last year
- FPGA version of Rodinia in HLS C/C++☆40Dec 24, 2020Updated 5 years ago
- ☆10Apr 28, 2023Updated 2 years ago
- This repo contains instructions, benchmarks, and files for running user space networking in gem5 simulator.☆11Aug 1, 2024Updated last year
- Hands-on experience programming AI Engines using Vitis Unified Software Platform☆40Jul 24, 2024Updated last year
- Distributed Communication-Optimal LU-factorization Algorithm☆12Aug 1, 2021Updated 4 years ago
- PYNQ for Zybo board☆11Jan 30, 2026Updated 2 weeks ago
- ☆12Mar 1, 2024Updated last year
- Simulation and data processing of the Gaussian-modulated coherent-state protocol with homodyne detection☆14Jan 10, 2022Updated 4 years ago
- [ICML 2025] MxMoE: Mixed-precision Quantization for MoE with Accuracy and Performance Co-Design☆22Jul 4, 2025Updated 7 months ago
- See if we can't do some real-time learning for GMRES -- Rejoice!☆12Jun 19, 2022Updated 3 years ago
- FPGA Additive White Gaussian Noise Generator Using the Box Mueller Method☆11Oct 7, 2016Updated 9 years ago
- Julia implementation of flash-attention operation for neural networks.☆11May 31, 2023Updated 2 years ago
- LaTeX Examples Document Source☆11Apr 9, 2024Updated last year
- An MPI wrapper for the pytorch tensor library that is automatically differentiable☆10Mar 27, 2023Updated 2 years ago
- The source codes of the proposed NB-LDPC decoder published in IEEE Communications Letters☆12Jan 8, 2018Updated 8 years ago
- SQL Optimizations using MLIR☆12Apr 5, 2020Updated 5 years ago