A profiler to disclose and quantify hardware features on GPUs.
☆176May 15, 2022Updated 4 years ago
Alternatives and similar repositories for ArchProbe
Users that are interested in ArchProbe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multi-branch model for concurrent execution☆18Jun 27, 2023Updated 3 years ago
- LLM inference in C/C++☆21Oct 22, 2025Updated 8 months ago
- Handy tools & graphics API abstraction for blazing fast prototyping☆10Jan 17, 2024Updated 2 years ago
- mperf是一个面向移动/嵌入式平台的算子性能调优工具箱☆194Aug 17, 2023Updated 2 years ago
- A micro Vulkan compute pipeline and a collection of benchmarking compute shaders☆264Apr 9, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Get Windows System Root certificates☆16Jan 21, 2026Updated 5 months ago
- ☆11Sep 4, 2025Updated 9 months ago
- ☆256Sep 15, 2023Updated 2 years ago
- A tool for patching the TensorFlow frozen protobuf file for compatibility to RKNN & SNPE SDK.☆11Feb 22, 2021Updated 5 years ago
- Tensor Tiling Library☆42Sep 23, 2025Updated 9 months ago
- A utility library for application developers to sample Arm Immortalis GPU or Arm Mali GPU performance counters.☆275May 8, 2026Updated last month
- row-major matmul optimization☆737May 14, 2026Updated last month
- A tool which profiles Vulkan devices to find their peak capacities☆171Jun 21, 2026Updated last week
- Derivation and numerical validation for the paper "Microsurface Transformations" (EGSR 2022) by Asen Atanasov, Vladimir Koylazov, Rossen …☆23Jul 11, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- MLPerf™ Mobile models☆26Apr 30, 2026Updated last month
- Dynamic suballocators for external memory (e.g., Vulkan device memory). Umaintained - consider migrating to https://crates.io/crates/offs…☆15Jul 22, 2022Updated 3 years ago
- A synthetic micro-benchmark that measures peak compute, bandwidth, and matrix throughput of GPUs and CPUs☆500Jun 22, 2026Updated last week
- ☆26Feb 20, 2024Updated 2 years ago
- A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.☆364Jul 30, 2024Updated last year
- Parsers for CUDA binary files☆25Dec 29, 2023Updated 2 years ago
- lossy GPU-friendly image compression\decompression☆13Dec 5, 2021Updated 4 years ago
- MFCStoreClient is an example of how to access Windows Store APIs from a C++ MFC app.☆20Sep 1, 2022Updated 3 years ago
- modified cutlass☆16Oct 26, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆1,002Sep 19, 2024Updated last year
- Manually implemented quantization-aware training☆23Oct 12, 2022Updated 3 years ago
- FLOꟼ - An MIT-licensed image viewer equipped with a GPU-accelerated perceptual image diffing algorithm based on ꟻLIP☆68Jun 12, 2022Updated 4 years ago
- Light weight SPIR-V reflection library☆112Feb 23, 2026Updated 4 months ago
- ☆12Mar 1, 2024Updated 2 years ago
- Improved Blue Noise Generator☆42Nov 11, 2025Updated 7 months ago
- Converts RenderDoc Vulkan capture to compilable and executable C++ code.☆26Jan 6, 2020Updated 6 years ago
- ☆37Apr 10, 2024Updated 2 years ago
- A stub opecl library that dynamically dlopen/dlsyms opencl implementations at runtime based on environment variables. Will be useful when…☆74Apr 30, 2026Updated last month
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Apple GPU microarchitecture☆617Sep 22, 2024Updated last year
- MegEngine到其他框架的转换器☆71Apr 27, 2023Updated 3 years ago
- Yinghan's Code Sample☆365Jul 25, 2022Updated 3 years ago
- Fork of https://source.codeaurora.org/quic/hexagon_nn/nnlib☆59Apr 10, 2023Updated 3 years ago
- Fundamental Sources for Water Wave Animation☆20Dec 8, 2022Updated 3 years ago
- MobiSys#114☆23Aug 17, 2023Updated 2 years ago
- ☆99Nov 4, 2022Updated 3 years ago