An implementation of HPL-AI Mixed-Precision Benchmark based on hpl-2.3
☆29May 30, 2021Updated 4 years ago
Alternatives and similar repositories for HPL-AI
Users that are interested in HPL-AI are comparing it to the libraries listed below
Sorting:
- Spack package repository maintained by Student Cluster Competition Team @ Sun Yat-sen University.☆16Aug 20, 2025Updated 6 months ago
- An HPL-AI implementation for Fugaku☆23Jun 29, 2021Updated 4 years ago
- ☆14Nov 2, 2018Updated 7 years ago
- GoPTX: Fine-grained GPU Kernel Fusion by PTX-level Instruction Flow Weaving☆20Jul 30, 2025Updated 7 months ago
- ☆24Mar 21, 2024Updated last year
- Kratos: An FPGA Benchmark for Unrolled Deep Neural Networks with Fine-Grained Sparsity and Mixed Precision☆12Jan 19, 2026Updated last month
- ☆10May 12, 2022Updated 3 years ago
- ☆11Mar 27, 2024Updated last year
- ☆12Oct 25, 2022Updated 3 years ago
- ☆14Dec 5, 2024Updated last year
- Slides and exercises for persistent memory programming tutorial☆14Nov 14, 2022Updated 3 years ago
- The Zaychik Power Controller server☆13Apr 13, 2024Updated last year
- High Performance Linpack for GPUs (Using OpenCL, CUDA, CAL)☆92Oct 22, 2015Updated 10 years ago
- ☆15Dec 26, 2022Updated 3 years ago
- SMT-LIB benchmarks for shape computations from deep learning models in PyTorch☆18Dec 21, 2022Updated 3 years ago
- ☆40Apr 3, 2022Updated 3 years ago
- outline and links for PLDI 2022 tutorial☆17Jun 13, 2022Updated 3 years ago
- Follow nginx log, and find out bad guys!☆23Feb 2, 2026Updated last month
- Race detector for NVIDIA GPUs, published in SOSP 2021.☆17Feb 22, 2025Updated last year
- A GPU FP32 computation method with Tensor Cores.☆26Dec 8, 2025Updated 2 months ago
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆84Mar 20, 2023Updated 2 years ago
- ngAP's artifact for ASPLOS'24☆25Jul 29, 2025Updated 7 months ago
- Overcoming the IOTLB Wall for Multi-100-Gbps Linux-based Networking☆24May 16, 2023Updated 2 years ago
- simple port of hpl-2.0 to use NVIDIA GPU accelation with CUBLAS☆29May 13, 2013Updated 12 years ago
- 使用 Github Actions 自动完成每周青年大学习☆17Dec 11, 2021Updated 4 years ago
- YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-d…☆115Jul 15, 2025Updated 7 months ago
- Header-only C++20 wrapper for MPI 4.0.☆47Jan 29, 2026Updated last month
- Create and deploy virtual-experiments - co-processing computational workflows☆10Jan 28, 2026Updated last month
- ☆37Updated this week
- ☆31Jun 15, 2022Updated 3 years ago
- ☆11Jan 21, 2021Updated 5 years ago
- PARADIS, a lightweight and flexible weather forecast model that tries to Keep It Simple.☆26Feb 4, 2026Updated 3 weeks ago
- Memory Topology for GPUs☆17Feb 13, 2026Updated 2 weeks ago
- Port of the LLVM compiler infrastructure to the time-predictable processor Patmos☆15Apr 2, 2025Updated 11 months ago
- ext_mpi_collectives☆11Apr 1, 2025Updated 11 months ago
- HPCG benchmark based on ROCm platform☆39Updated this week
- CQU Dual Issue Machine☆38Jun 23, 2024Updated last year
- A mini, simple and modular compiler for SYsU/SysY(tiny C). Based on Clang/LLVM/ANTLR4/Bison/Flex.☆219Nov 27, 2024Updated last year
- ☆52Updated this week