A small OpenCL benchmark program to measure peak GPU/CPU performance.
☆295May 10, 2026Updated last week
Alternatives and similar repositories for OpenCL-Benchmark
Users that are interested in OpenCL-Benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆57Mar 20, 2025Updated last year
- OpenCL is the most powerful programming language ever created. Yet the OpenCL C++ bindings are cumbersome and the code overhead prevents …☆474May 10, 2026Updated last week
- A synthetic micro-benchmark that measures the peak achievable performance of GPU compute devices☆488Updated this week
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆457Feb 7, 2026Updated 3 months ago
- An Open-Source SCAlable Interface for ISA Extensionsfor RISC-V Processors. New Version:☆17Feb 29, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A tool which profiles Vulkan devices to find their peak capacities☆169Apr 14, 2026Updated last month
- ☆14Nov 3, 2025Updated 6 months ago
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆16Mar 19, 2023Updated 3 years ago
- Implementation of OpenCL 3.0 on Vulkan☆431May 6, 2026Updated 2 weeks ago
- Derived from Nemes' gpuperftests☆34Jul 11, 2024Updated last year
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆68Updated this week
- Tuned OpenCL BLAS☆1,174Apr 13, 2026Updated last month
- ☆32Jul 2, 2025Updated 10 months ago
- Microbenchmarks showing relative performance of different Python functions/patterns.☆13Oct 3, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Print all known information about all available OpenCL platforms and devices in the system☆374Dec 19, 2025Updated 5 months ago
- AOSC Scriptlets, whatever that's too small to open a repository for☆12Mar 31, 2026Updated last month
- Mini-applications that exclusively use the Kokkos programming model☆12Mar 21, 2023Updated 3 years ago
- Dissecting NVIDIA GPU Architecture☆121Jul 11, 2022Updated 3 years ago
- Experimental OpenCL SPIR-V to OpenCL C translator☆29Mar 1, 2026Updated 2 months ago
- AOCL-Utils library to get CPU architecture, Cache information and CPU features flags etc.☆17Mar 24, 2026Updated last month
- MobileSAM のエンコーダー/デコーダーをONNXに変換し、推論するサンプル☆12Apr 11, 2024Updated 2 years ago
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆359May 5, 2026Updated 2 weeks ago
- Multi-heap-sort for many small arrays, quicksort with 3 pivots for one big array, CUDA acceleration, CUDA memory compression.☆13Sep 29, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A fortran library of fast functions☆28Jul 16, 2025Updated 10 months ago
- ☆167Updated this week
- CUDA GPU Benchmark☆38Jan 31, 2025Updated last year
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆166May 4, 2026Updated 2 weeks ago
- ☆12Aug 31, 2023Updated 2 years ago
- 小彭老师推出 SyCL 2020 课程(施工中,日后会在直播中放出)☆15Sep 3, 2023Updated 2 years ago
- SMT-LIB benchmarks for shape computations from deep learning models in PyTorch☆18Dec 21, 2022Updated 3 years ago
- Tristan-MP v2 [public]☆20Dec 29, 2024Updated last year
- BFloat16 Fused Adam Operator for PyTorch☆19Nov 16, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Apr 2, 2025Updated last year
- Benchmarks☆19May 14, 2026Updated last week
- ☆10Jul 18, 2024Updated last year
- DXT Explorer is an interactive web-based log analysis tool for Darshan DXT logs.☆17Feb 19, 2026Updated 3 months ago
- Fortran commandline-interface using a simple prototype command☆26Mar 31, 2026Updated last month
- A minimal in MLIR dialect along the lines of STG to represent laziness.☆17Jan 7, 2022Updated 4 years ago
- AutoParBench is a benchmark framework to evaluate compilers and tools designed to automatically insert OpenMP directives.☆12Nov 6, 2020Updated 5 years ago