deater / performance_resultsLinks
performance results/benchmarks for a variety of machines
☆31Updated 7 months ago
Alternatives and similar repositories for performance_results
Users that are interested in performance_results are comparing it to the libraries listed below
Sorting:
- Provides a set of benchmarks that can be used to measure the memory bandwidth performance of CPU's☆91Updated last year
- EEMBC's Machine-Learning Inference Benchmark targeted at edge devices.☆50Updated 3 years ago
- Alveo Collective Communication Library: MPI-like communication operations for Xilinx Alveo accelerators☆97Updated 3 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆84Updated last week
- GPTPU for SC 2021☆52Updated 2 years ago
- CUPTI GPU Profiler☆40Updated 6 years ago
- SST Architectural Simulation Components and Libraries☆102Updated last week
- Emulating DMA Engines on GPUs for Performance and Portability☆41Updated 10 years ago
- Python Cache Hierarchy Simulator☆99Updated 2 months ago
- Forked from https://bitbucket.org/berkeleylab/cs-roofline-toolkit/src/master/☆23Updated 6 years ago
- A tool for examining GPU scheduling behavior.☆88Updated last year
- This project records the process of optimizing SGEMM (single-precision floating point General Matrix Multiplication) on the riscv platfor…☆24Updated 10 months ago
- A Benchmark Suite for Heterogeneous System Computation☆54Updated 7 months ago
- A Portable C Library for Distributed CNN Inference on IoT Edge Clusters☆83Updated 5 years ago
- A simple script to plot the Roofline model for given HW platforms and applications☆10Updated last year
- Advanced Matrix Extensions (AMX) Guide☆102Updated 3 years ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆152Updated last week
- First-Class GPU Resource Management: Device Drivers, Runtimes, and CUDA Compilers for Nouveau.☆48Updated 7 years ago
- ☆60Updated 3 years ago
- ☆179Updated this week
- ☆13Updated 5 years ago
- The SHOC Benchmark Suite☆257Updated this week
- GPUnet is a native GPU networking layer that provides a socket abstraction over Infiniband to GPU programs for NVIDIA GPUs.☆116Updated 10 years ago
- SST Structural Simulation Toolkit Parallel Discrete Event Core and Services☆175Updated last week
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆47Updated last week
- A repository where GPU applications are aggregated using a common build flow that supports multiple CUDA versions.☆77Updated last week
- oneAPI Collective Communications Library (oneCCL)☆245Updated 2 weeks ago
- TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together☆64Updated 7 years ago
- Issues related to MLPerf™ Inference policies, including rules and suggested changes☆64Updated 3 weeks ago
- GPUDirect example☆60Updated 3 years ago