RIKEN-RCCS/hpl-ai

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RIKEN-RCCS/hpl-ai)

RIKEN-RCCS / hpl-ai

An HPL-AI implementation for Fugaku

☆24

Alternatives and similar repositories for hpl-ai

Users that are interested in hpl-ai are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wu-kan / HPL-AI
View on GitHub
An implementation of HPL-AI Mixed-Precision Benchmark based on hpl-2.3
☆30May 30, 2021Updated 5 years ago
davidrohr / caldgemm
View on GitHub
Portable and Flexible DGEMM Library for GPUs (OpenCL, CUDA, CAL) with special support for HPL
☆16Apr 5, 2018Updated 8 years ago
flecsi / flecsi
View on GitHub
Flexible Computational Science (FleCSI) Project
☆27Updated this week
GSI-HPC / slurm-singularity-exec
View on GitHub
The Singularity SPANK plugin provides the users with an interface to launch an application within a Linux container.
☆14Nov 4, 2025Updated 8 months ago
facebookresearch / torch_ucc
View on GitHub
Pytorch process group third-party plugin for UCC
☆22Apr 15, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
helmholtz-analytics / mpi4torch
View on GitHub
An MPI wrapper for the pytorch tensor library that is automatically differentiable
☆10Mar 27, 2023Updated 3 years ago
ARM-software / HPCG_for_Arm
View on GitHub
☆30Dec 16, 2022Updated 3 years ago
argonne-lcf / alcl
View on GitHub
Argonne Leadership Computing Facility OpenCL tutorial
☆10Aug 22, 2025Updated 10 months ago
UoB-HPC / miniBUDE
View on GitHub
A BUDE virtual-screening benchmark, in many programming models
☆31Oct 15, 2024Updated last year
reger-men / HPL_GPU
View on GitHub
High-Performance Linpack Benchmark adopted version for GPU backend
☆12Sep 12, 2022Updated 3 years ago
regehr / pldi22-llvm-tutorial
View on GitHub
outline and links for PLDI 2022 tutorial
☆17Jun 13, 2022Updated 4 years ago
openucx / xpmem
View on GitHub
Linux Cross-Memory Attach
☆23Apr 21, 2026Updated 3 months ago
clusterinthecloud / docs
View on GitHub
A tutorial to set up a running compute cluster on cloud resources
☆11Jul 7, 2023Updated 3 years ago
UoB-HPC / minifmm
View on GitHub
☆11Aug 8, 2021Updated 4 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
harrism / cuda_event_benchmark
View on GitHub
Unit benchmarks of CUDA event APIs.
☆17Apr 23, 2024Updated 2 years ago
md2z34 / winograd_gpu
View on GitHub
GPU implementation of Winograd convolution
☆10Oct 23, 2017Updated 8 years ago
mrnorman / miniWeatherML
View on GitHub
Exploring Machine Learning methods and workflows in a simplified weather model
☆19Jun 6, 2024Updated 2 years ago
bsc-pm / nanos6
View on GitHub
Nanos6 is a runtime that implements the OmpSs-2 parallel programming model, developed by the System Tools and Advanced Runtimes (STAR) gr…
☆22Jun 15, 2026Updated last month
quettabit / convolution_kernel
View on GitHub
Accelerating CNN's convolution operation on GPUs by using memory-efficient data access patterns.
☆14Dec 8, 2017Updated 8 years ago
ECP-copa / CabanaMD
View on GitHub
Molecular dynamics proxy application based on Cabana
☆21Feb 20, 2025Updated last year
IBM / HPCG
View on GitHub
Open source of an IBM Optimized version of the HPCG benchmark.
☆17Sep 17, 2025Updated 10 months ago
aditya4d / gemm-vega64
View on GitHub
Implement asm gemm on vega64 for 4096x4096 fp32 matrix
☆22Oct 12, 2019Updated 6 years ago
pyxis-roc / ptxparser
View on GitHub
A parser for PTX 6.5
☆13Jun 19, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
openucx / xccl
View on GitHub
☆26May 19, 2021Updated 5 years ago
cubicstyle / raspiadvrw
View on GitHub
GBA Cartridge reader writer for Raspberry Pi ADVANCE Expansion Board.
☆17Jun 8, 2022Updated 4 years ago
hpcgame / hpcgame-platform-0th
View on GitHub
HPC Game Platform
☆11Apr 20, 2023Updated 3 years ago
ecrc / hicma
View on GitHub
HiCMA: Hierarchical Computations on Manycore Architectures
☆37Mar 19, 2023Updated 3 years ago
ENCCS / sycl-workshop
View on GitHub
SYCL materials for ENCCS workshop
☆25Apr 25, 2023Updated 3 years ago
ariasanovsky / ptx-parser
View on GitHub
☆11Jun 9, 2023Updated 3 years ago
KernelTuner / kernel_launcher
View on GitHub
Using C++ magic to capture CUDA kernels and tune them with Kernel Tuner
☆21Sep 12, 2025Updated 10 months ago
NaoyukiIchimura / cuda_image_filtering_global
View on GitHub
☆11Dec 5, 2018Updated 7 years ago
ROCm / AITemplate
View on GitHub
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…
☆12Jun 24, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
arm-hpc / porting-advisor
View on GitHub
Scans for potential unported or non-portable code in source code trees.
☆24May 20, 2025Updated last year
ChandlerGuan / kperfir_artifact
View on GitHub
☆19May 9, 2025Updated last year
deathwings602 / Unified-IR
View on GitHub
面向多平台编译优化的深度学习中间表示
☆10Oct 28, 2024Updated last year
LostXine / naive-android-ssr
View on GitHub
Naive Android Screen Stream Reader, this project decodes screenrecord stream from an Android device to OpenCV. Written in Python
☆13May 27, 2023Updated 3 years ago
NGIOproject / PMTutorial
View on GitHub
Slides and exercises for persistent memory programming tutorial
☆14Nov 14, 2022Updated 3 years ago
vortexgpgpu / NVPTX-SPIRV-Translator
View on GitHub
The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.
☆45Oct 25, 2021Updated 4 years ago
pecos / tps
View on GitHub
Torch Plasma Simulator
☆11Jul 13, 2026Updated last week