A profiler to disclose and quantify hardware features on GPUs.
☆176May 15, 2022Updated 3 years ago
Alternatives and similar repositories for ArchProbe
Users that are interested in ArchProbe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multi-branch model for concurrent execution☆18Jun 27, 2023Updated 2 years ago
- LLM inference in C/C++☆21Oct 22, 2025Updated 6 months ago
- Handy tools & graphics API abstraction for blazing fast prototyping☆10Jan 17, 2024Updated 2 years ago
- ☆19Feb 28, 2022Updated 4 years ago
- A micro Vulkan compute pipeline and a collection of benchmarking compute shaders☆264Apr 9, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Get Windows System Root certificates☆16Jan 21, 2026Updated 3 months ago
- ☆11Sep 4, 2025Updated 7 months ago
- The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]☆19Aug 4, 2022Updated 3 years ago
- ☆257Sep 15, 2023Updated 2 years ago
- Tensor Tiling Library☆40Sep 23, 2025Updated 7 months ago
- A utility library for application developers to sample Arm Immortalis GPU or Arm Mali GPU performance counters.☆271Apr 14, 2026Updated 2 weeks ago
- row-major matmul optimization☆721Feb 24, 2026Updated 2 months ago
- A tool which profiles Vulkan devices to find their peak capacities☆169Apr 14, 2026Updated 2 weeks ago
- Derivation and numerical validation for the paper "Microsurface Transformations" (EGSR 2022) by Asen Atanasov, Vladimir Koylazov, Rossen …☆23Jul 11, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Apple G13 GPU architecture docs and tools☆654May 16, 2025Updated 11 months ago
- MLPerf™ Mobile models☆26Nov 16, 2025Updated 5 months ago
- Dynamic suballocators for external memory (e.g., Vulkan device memory). Umaintained - consider migrating to https://crates.io/crates/offs…☆15Jul 22, 2022Updated 3 years ago
- A tool which profiles OpenCL devices to find their peak capacities☆488Updated this week
- ☆25Feb 20, 2024Updated 2 years ago
- A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.☆363Jul 30, 2024Updated last year
- Parsers for CUDA binary files☆24Dec 29, 2023Updated 2 years ago
- FidelityFX Parallel Sort☆120Oct 8, 2021Updated 4 years ago
- modified cutlass☆16Oct 26, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆1,000Sep 19, 2024Updated last year
- Manually implemented quantization-aware training☆23Oct 12, 2022Updated 3 years ago
- FLOꟼ - An MIT-licensed image viewer equipped with a GPU-accelerated perceptual image diffing algorithm based on ꟻLIP☆68Jun 12, 2022Updated 3 years ago
- ☆12Mar 1, 2024Updated 2 years ago
- Improved Blue Noise Generator☆42Nov 11, 2025Updated 5 months ago
- ☆36Apr 10, 2024Updated 2 years ago
- A stub opecl library that dynamically dlopen/dlsyms opencl implementations at runtime based on environment variables. Will be useful when…☆74Updated this week
- MegEngine到其他框架的转换器☆71Apr 27, 2023Updated 3 years ago
- This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiai☆137Apr 22, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Yinghan's Code Sample☆364Jul 25, 2022Updated 3 years ago
- Fork of https://source.codeaurora.org/quic/hexagon_nn/nnlib☆59Apr 10, 2023Updated 3 years ago
- Fundamental Sources for Water Wave Animation☆20Dec 8, 2022Updated 3 years ago
- MobiSys#114☆23Aug 17, 2023Updated 2 years ago
- ☆98Nov 4, 2022Updated 3 years ago
- ncnn android benchmark app☆86Aug 10, 2021Updated 4 years ago
- BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.☆762Aug 6, 2025Updated 8 months ago