mlcommons / mobile_models
MLPerf™ Mobile models
☆24Updated last month
Related projects: ⓘ
- FP64 equivalent GEMM via Int8 Tensor Cores using the Ozaki scheme☆44Updated 2 weeks ago
- benchmarking some transformer deployments☆26Updated last year
- CUDA 12.2 HMM demos☆16Updated last month
- CUDA Template Functions☆18Updated last month
- ☆11Updated last year
- A tracing JIT compiler for PyTorch☆12Updated 2 years ago
- Loop Nest - Linear algebra compiler and code generator.☆22Updated last year
- ☆20Updated 11 months ago
- A GPU performance profiling tool for PyTorch models☆22Updated 2 years ago
- Benchmarks to capture important workloads.☆28Updated 3 months ago
- ☆18Updated last year
- ☆17Updated this week
- cuASR: CUDA Algebra for Semirings☆30Updated 2 years ago
- A tracing JIT for PyTorch☆18Updated 2 years ago
- ☆11Updated 3 years ago
- ☆50Updated this week
- XLA integration of Open Neural Network Exchange (ONNX)☆19Updated 6 years ago
- ☆18Updated this week
- THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.☆81Updated 7 months ago
- CUDA accelerated medical imaging algorithms☆14Updated 2 years ago
- ☆16Updated 2 years ago
- A Winograd Minimal Filter Implementation in CUDA☆20Updated 3 years ago
- Python Interface to HIP and hiprtc Library☆9Updated 10 months ago
- TORCH_LOGS parser for PT2☆19Updated 2 weeks ago
- ☆66Updated last year
- a compiler for re-writing image processing functions in C++ to Halide☆22Updated last year
- This library empowers users to seamlessly port pretrained models and checkpoints on the HuggingFace (HF) hub (developed using HF transfor…☆37Updated this week
- An extension library of WMMA API (Tensor Core API)☆81Updated 2 months ago
- ONNX Command-Line Toolbox☆35Updated last year
- ONNX Parser is a tool that automatically generates openvx inference code (CNN) from onnx binary model files.☆17Updated 5 years ago