NPBench - A Benchmarking Suite for High-Performance NumPy
☆92Apr 15, 2026Updated 2 months ago
Alternatives and similar repositories for npbench
Users that are interested in npbench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DaCe - Data Centric Parallel Programming☆590Updated this week
- Streaming Message Interface: High-Performance Distributed Memory Programming on Reconfigurable Hardware☆15Mar 1, 2022Updated 4 years ago
- Data-Centric MLIR dialect☆47Oct 16, 2023Updated 2 years ago
- Distributed Communication-Optimal LU-factorization Algorithm☆12Aug 1, 2021Updated 4 years ago
- Selected Decomposition Routines☆23Apr 20, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- BLAS implementation for Intel FPGA☆78Nov 18, 2020Updated 5 years ago
- Standalone mini-app of the ECMWF cloud microphysics parameterization☆11Jun 12, 2026Updated 2 weeks ago
- Code examples for "Under the hood of calling C/C++ from Python"☆13Sep 16, 2020Updated 5 years ago
- ☆17Dec 8, 2023Updated 2 years ago
- oneAPI Deep Neural Network Library (oneDNN)☆10Feb 2, 2022Updated 4 years ago
- Get started with your NVIDIA Arm HPC Developers Kit!☆33Feb 16, 2023Updated 3 years ago
- MLIR tools and dialect for GraphBLAS☆18Mar 30, 2022Updated 4 years ago
- GPU Performance Advisor☆66Jul 25, 2022Updated 3 years ago
- Python interface for MLIR - the Multi-Level Intermediate Representation☆271Nov 28, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ext_mpi_collectives☆11Jun 3, 2026Updated 3 weeks ago
- ☆16Oct 25, 2022Updated 3 years ago
- Reference implementations of MLPerf™ HPC training benchmarks☆51Feb 25, 2025Updated last year
- A Data-Centric Compiler for Machine Learning☆85Dec 14, 2025Updated 6 months ago
- cuASR: CUDA Algebra for Semirings☆49Aug 22, 2022Updated 3 years ago
- The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github…☆33Feb 21, 2026Updated 4 months ago
- ☆14Feb 14, 2022Updated 4 years ago
- MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com☆39Dec 1, 2023Updated 2 years ago
- An open-source framework for benchmarking of feature selection algorithms and cost functions.☆10Mar 6, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Deferred Continuous Batching in Resource-Efficient Large Language Model Serving (EuroMLSys 2024)☆19May 28, 2024Updated 2 years ago
- Microbenchmarks showing relative performance of different Python functions/patterns.☆13Oct 3, 2025Updated 8 months ago
- RestFrames: particle physics event analysis library☆11Dec 17, 2020Updated 5 years ago
- Cloud Hackathon for Arm-based HPC with AWS and Arm☆31May 20, 2022Updated 4 years ago
- An MLIR-based compiler from C/C++ to AMD-Xilinx Versal AIE☆17Aug 5, 2022Updated 3 years ago
- [FPGA 2023] FADO: Floorplan-Aware Directive Optimization for High-Level Synthesis Designs on Multi-Die FPGAs☆25Feb 14, 2023Updated 3 years ago
- Environment modules for NGC containers☆30Nov 19, 2021Updated 4 years ago
- A low-level intermediate representation for hardware description languages☆28Jun 28, 2020Updated 6 years ago
- Absinthe is an optimization framework to fuse and tile stencil codes in one shot☆14Jul 17, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ACM TODAES Best Paper Award, 2022☆35Oct 24, 2023Updated 2 years ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆33Apr 2, 2025Updated last year
- ☆11Aug 8, 2021Updated 4 years ago
- Python interface for the LIKWID C API (https://github.com/RRZE-HPC/likwid)☆50Jun 1, 2026Updated last month
- pandoc-like tool for symbolic regression expressions☆15Mar 10, 2024Updated 2 years ago
- How to call NVTX from Fortran☆13Jun 25, 2025Updated last year
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆215Apr 18, 2026Updated 2 months ago