NPBench - A Benchmarking Suite for High-Performance NumPy
☆92Jan 28, 2026Updated last month
Alternatives and similar repositories for npbench
Users that are interested in npbench are comparing it to the libraries listed below
Sorting:
- DaCe - Data Centric Parallel Programming☆580Updated this week
- Streaming Message Interface: High-Performance Distributed Memory Programming on Reconfigurable Hardware☆15Mar 1, 2022Updated 4 years ago
- Data-Centric MLIR dialect☆46Oct 16, 2023Updated 2 years ago
- Arrow Matrix Decomposition - Communication-Efficient Distributed Sparse Matrix Multiplication☆15Mar 25, 2024Updated last year
- Distributed Communication-Optimal LU-factorization Algorithm☆12Aug 1, 2021Updated 4 years ago
- BLAS implementation for Intel FPGA☆78Nov 18, 2020Updated 5 years ago
- Standalone mini-app of the ECMWF cloud microphysics parameterization☆11Feb 24, 2026Updated 3 weeks ago
- ☆17Dec 8, 2023Updated 2 years ago
- Get started with your NVIDIA Arm HPC Developers Kit!☆33Feb 16, 2023Updated 3 years ago
- MLIR tools and dialect for GraphBLAS☆18Mar 30, 2022Updated 3 years ago
- GPU Performance Advisor☆66Jul 25, 2022Updated 3 years ago
- The Global Environmental Multiscale (GEM) model is a numerical weather prediction model developed by the Meteorological Research Division…☆25Mar 11, 2026Updated last week
- ext_mpi_collectives☆11Apr 1, 2025Updated 11 months ago
- ☆16Oct 25, 2022Updated 3 years ago
- Reference implementations of MLPerf™ HPC training benchmarks☆50Feb 25, 2025Updated last year
- A Data-Centric Compiler for Machine Learning☆85Dec 14, 2025Updated 3 months ago
- cuASR: CUDA Algebra for Semirings☆45Aug 22, 2022Updated 3 years ago
- ☆61Aug 4, 2023Updated 2 years ago
- A translation validation framework for MLIR☆96Mar 19, 2025Updated last year
- MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com☆38Dec 1, 2023Updated 2 years ago
- An open-source framework for benchmarking of feature selection algorithms and cost functions.☆10Mar 6, 2020Updated 6 years ago
- Deferred Continuous Batching in Resource-Efficient Large Language Model Serving (EuroMLSys 2024)☆19May 28, 2024Updated last year
- Microbenchmarks showing relative performance of different Python functions/patterns.☆13Oct 3, 2025Updated 5 months ago
- RestFrames: particle physics event analysis library☆10Dec 17, 2020Updated 5 years ago
- Cloud Hackathon for Arm-based HPC with AWS and Arm☆31May 20, 2022Updated 3 years ago
- An MLIR-based compiler from C/C++ to AMD-Xilinx Versal AIE☆17Aug 5, 2022Updated 3 years ago
- [FPGA 2023] FADO: Floorplan-Aware Directive Optimization for High-Level Synthesis Designs on Multi-Die FPGAs☆25Feb 14, 2023Updated 3 years ago
- pandoc-like tool for symbolic regression expressions☆14Mar 10, 2024Updated 2 years ago
- Standard interface for collecting HPC run metadata☆16Nov 7, 2025Updated 4 months ago
- A low-level intermediate representation for hardware description languages☆28Jun 28, 2020Updated 5 years ago
- ACM TODAES Best Paper Award, 2022☆34Oct 24, 2023Updated 2 years ago
- Absinthe is an optimization framework to fuse and tile stencil codes in one shot☆14Jul 17, 2019Updated 6 years ago
- ☆11Aug 8, 2021Updated 4 years ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆31Apr 2, 2025Updated 11 months ago
- Apollo: Online Machine Learning for Performance Portability☆26Aug 27, 2024Updated last year
- How to call NVTX from Fortran☆12Jun 25, 2025Updated 8 months ago
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆213Mar 3, 2026Updated 2 weeks ago
- A pseudo random number generator library written against the SYCL API.☆11Jun 11, 2019Updated 6 years ago
- C++17 implementation of an AST for Verilog code generation☆24Jun 14, 2023Updated 2 years ago