mlcommons / mobile_models
MLPerf™ Mobile models
☆24Updated 3 months ago
Alternatives and similar repositories for mobile_models:
Users that are interested in mobile_models are comparing it to the libraries listed below
- CUDA accelerated medical imaging algorithms☆13Updated 2 years ago
- A tracing JIT compiler for PyTorch☆12Updated 3 years ago
- LLVM-Canon aims to transform LLVM modules into a canonical form by reordering and renaming instructions while preserving the same semanti…☆14Updated 8 months ago
- Machine Intelligence Shader Autogen. AMDGPU ML shader code generator. (previously iGEMMgen)☆34Updated 3 months ago
- FP64 equivalent GEMM via Int8 Tensor Cores using the Ozaki scheme☆54Updated 4 months ago
- benchmarking some transformer deployments☆26Updated last year
- ONNX Parser is a tool that automatically generates openvx inference code (CNN) from onnx binary model files.☆17Updated 6 years ago
- An IR for efficiently simulating distributed ML computation.☆25Updated last year
- Loop Nest - Linear algebra compiler and code generator.☆22Updated 2 years ago
- AMD ROCm Performance Primitives (RPP) library is a comprehensive high-performance computer vision library for AMD processors with HIP/Ope…☆57Updated this week
- Benchmarks to capture important workloads.☆29Updated this week
- ☆13Updated last year
- An Architecture-level Fault Injection Tool for GPU Application Resilience Evaluations☆16Updated 4 years ago
- Issues related to MLPerf™ Inference policies, including rules and suggested changes☆58Updated last week
- ☆15Updated 3 weeks ago
- A 8-/16-/32-/64-bit floating point number family☆16Updated 2 years ago
- ☆9Updated last year
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆47Updated last year
- CUDA Template Functions☆19Updated last month
- THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.☆83Updated 10 months ago
- ONNX Command-Line Toolbox☆35Updated 3 months ago
- A GPU performance prediction toolkit for CUDA programs☆16Updated 5 years ago
- Random number library that generate pseudo-random and quasi-random numbers.☆25Updated this week
- Yet another Polyhedra Compiler for DeepLearning☆19Updated last year
- npcomp - An aspirational MLIR based numpy compiler☆51Updated 4 years ago
- Sandbox for TVM and playing around!☆22Updated 2 years ago
- A Winograd Minimal Filter Implementation in CUDA☆23Updated 3 years ago
- An extension library of WMMA API (Tensor Core API)☆87Updated 6 months ago