Starlight: A Kernel Optimizer for GPU Processing
☆16Jan 10, 2024Updated 2 years ago
Alternatives and similar repositories for starlight
Users that are interested in starlight are comparing it to the libraries listed below
Sorting:
- Template Repository for Xilinx HLS design flow☆12Nov 18, 2021Updated 4 years ago
- LOGAN: High-Performance Multi-GPU X-Drop Long-Read Alignment.☆30Sep 23, 2022Updated 3 years ago
- Polyglot CUDA integration for the GraalVM☆18Apr 6, 2025Updated 11 months ago
- ☆14Nov 30, 2023Updated 2 years ago
- A OpenCL-based FPGA benchmark suite for HPC☆37Jan 29, 2026Updated last month
- A Scalable BFS Accelerator on FPGA-HBM Platform☆15Feb 22, 2024Updated 2 years ago
- Tuning Assistant for Floating point to Fixed point Optimization☆19Mar 26, 2022Updated 3 years ago
- PYNQ bindings for C and C++ to avoid requiring Python or Vitis to execute hardware acceleration.☆30Feb 23, 2026Updated last week
- AIM: Accelerating Arbitrary-precision Integer Multiplication on Heterogeneous Reconfigurable Computing Platform Versal ACAP (Full Paper a…☆25May 18, 2025Updated 9 months ago
- Educational verilog library that supports IEEE754 floating point arithmetic with a parametrizable mantissa and exponent☆32Mar 13, 2025Updated 11 months ago
- ☆38Jul 6, 2025Updated 8 months ago
- An awesome curated list of languages and tools to program FPGAs☆73Jun 22, 2022Updated 3 years ago
- ARIES: An Agile MLIR-Based Compilation Flow for Reconfigurable Devices with AI Engines (FPGA 2025 Best Paper Nominee)☆59Feb 24, 2026Updated last week
- ☆38Mar 14, 2024Updated last year
- ETHZ Heterogeneous Accelerated Compute Cluster.☆38Oct 7, 2025Updated 4 months ago
- Pipeline used internally for Peter Bubenik's TDA Group at UF.☆11Nov 3, 2022Updated 3 years ago
- Learning Environment-aware and hardware-compatible beam-forming codebooks☆15Mar 8, 2020Updated 5 years ago
- Ariston Net integration with home assistant☆10Nov 3, 2020Updated 5 years ago
- Argonne Leadership Computing Facility OpenCL tutorial☆10Aug 22, 2025Updated 6 months ago
- Sparse symmetric indefinite solver implemented with a runtime system☆13May 11, 2020Updated 5 years ago
- ☆12Apr 2, 2025Updated 11 months ago
- We implement the progressive Improved Progressive BKZ with Lattice Sieving presented in https://eprint.iacr.org/2022/1343, one can call i…☆13Feb 14, 2025Updated last year
- Number Geometry methods: Shortest Vector Problem and Shorter Basis Problem in Lattice (Hamming distance, Bounded distance decoding, bina…☆13May 19, 2023Updated 2 years ago
- This repo contains instructions, benchmarks, and files for running user space networking in gem5 simulator.☆12Aug 1, 2024Updated last year
- ☆10Apr 28, 2023Updated 2 years ago
- FPGA version of Rodinia in HLS C/C++☆40Dec 24, 2020Updated 5 years ago
- Hands-on experience programming AI Engines using Vitis Unified Software Platform☆40Jul 24, 2024Updated last year
- ManifoldNet Paper Implementation for SPD(n)☆11Nov 10, 2021Updated 4 years ago
- Microbenchmark that unveals the mechanisms behind power readings reported by nvidia-smi on your NVIDIA GPU.☆14Dec 12, 2024Updated last year
- Bagua tutorials.☆13Sep 4, 2022Updated 3 years ago
- matlab implementation of online dictionary learning with example driver code☆11Apr 16, 2016Updated 9 years ago
- LDPC decoders for ARM processor☆12Jul 23, 2021Updated 4 years ago
- LaTeX Examples Document Source☆11Apr 9, 2024Updated last year
- ☆57Jul 11, 2024Updated last year
- Website for Particle Physics Domain (UCSD Capstone)☆12Oct 23, 2021Updated 4 years ago
- High-throughput LDPC decoder on GPU device (see published IEEE article)☆12Jan 25, 2019Updated 7 years ago
- Chameleon: A MatMul-Free TCN Accelerator for End-to-End Few-Shot and Continual Learning from Sequential Data☆26Jun 6, 2025Updated 9 months ago
- Distributed Communication-Optimal LU-factorization Algorithm☆12Aug 1, 2021Updated 4 years ago
- ☆10Mar 2, 2024Updated 2 years ago