flozz / pypapi
Python binding for the PAPI (Performance Application Programming Interface) library
☆41Updated this week
Related projects: ⓘ
- Python interface for the LIKWID C API (https://github.com/RRZE-HPC/likwid)☆43Updated last year
- Tools and extensions for CUDA profiling☆63Updated 4 years ago
- Worked example of the process from Python source to CUDA kernel execution with Numba☆36Updated last week
- A Deep Learning Meta-Framework and HPC Benchmarking Library☆80Updated 2 years ago
- [CF ’20] Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs☆14Updated 3 years ago
- Kernel Tuning Toolkit☆54Updated 3 weeks ago
- A GPU performance prediction toolkit for CUDA programs☆16Updated 5 years ago
- Python bindings for UCX☆120Updated this week
- Automatically insert nvtx ranges to PyTorch models☆17Updated 3 years ago
- Python bindings for NVTX☆66Updated last year
- An ONNX backend using PlaidML☆28Updated 6 years ago
- nGraph™ Backend for ONNX☆42Updated last year
- An Aspiring Drop-In Replacement for Pandas at Scale☆73Updated 2 years ago
- ONNX-backed array library that is compliant with the Array API standard.☆29Updated last week
- A tracing JIT compiler for PyTorch☆12Updated 2 years ago
- FlipIt: An LLVM Based Fault Injector for HPC☆16Updated 3 years ago
- ParaDnn: A systematic performance analysis methodology for deep learning.☆39Updated 4 years ago
- PyProf2: PyTorch Profiling tool☆83Updated 4 years ago
- Notes and artifacts from the ONNX steering committee☆24Updated this week
- NPBench - A Benchmarking Suite for High-Performance NumPy☆73Updated 3 months ago
- A collection of scientific kernels using the numpy module for benchmarking purpose☆38Updated 3 years ago
- Bandwidth test for ROCm☆45Updated this week
- NVIDIA's launch, startup, and logging scripts used by our MLPerf Training and HPC submissions☆23Updated last month
- MIOpenGEMM is now deprecated☆61Updated last year
- Productionize machine learning predictions, with ONNX or without☆66Updated 8 months ago
- A task benchmark☆39Updated last month
- XLA integration of Open Neural Network Exchange (ONNX)☆19Updated 6 years ago
- Benchmarks to capture important workloads.☆28Updated 3 months ago
- Intel(R) Machine Learning Scaling Library is a library providing an efficient implementation of communication patterns used in deep learn…☆109Updated last year
- Includes Python bindings to instrumentation and tracing technology (ITT) APIs for VTune☆25Updated 8 months ago