onnx / digestaiLinks
Digest AI is a powerful model analysis tool that extracts insights from your models.
☆34Updated 4 months ago
Alternatives and similar repositories for digestai
Users that are interested in digestai are comparing it to the libraries listed below
Sorting:
- AI Tensor Engine for ROCm☆292Updated this week
 - TORCH_LOGS parser for PT2☆62Updated last month
 - Ahead of Time (AOT) Triton Math Library☆80Updated this week
 - An experimental CPU backend for Triton (https//github.com/openai/triton)☆47Updated 2 months ago
 - Home for OctoML PyTorch Profiler☆114Updated 2 years ago
 - Efficient in-memory representation for ONNX, in Python☆30Updated this week
 - MLPerf™ logging library☆37Updated 2 weeks ago
 - MLIR-based partitioning system☆143Updated this week
 - TritonParse: A Compiler Tracer, Visualizer, and Reproducer for Triton Kernels☆167Updated this week
 - ☆61Updated this week
 - No-code CLI designed for accelerating ONNX workflows☆215Updated 4 months ago
 - A fork of tvm/unity☆14Updated 2 years ago
 - OpenAI Triton backend for Intel® GPUs☆215Updated this week
 - ☆42Updated last month
 - Fast low-bit matmul kernels in Triton☆388Updated last week
 - Write a fast kernel and run it on Discord. See how you compare against the best!☆58Updated 3 weeks ago
 - Visualize ONNX models with model-explorer☆62Updated 3 weeks ago
 - A lightweight MLIR Python frontend with support for PyTorch☆29Updated last year
 - Dev repo for power measurement for the MLPerf™ benchmarks☆25Updated last month
 - ☆68Updated 2 years ago
 - Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.☆389Updated last week
 - A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆359Updated this week
 - ☆41Updated 10 months ago
 - Development repository for the Triton language and compiler☆136Updated this week
 - Model compression for ONNX☆98Updated 11 months ago
 - A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.☆543Updated this week
 - AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming☆93Updated last week
 - Unified compiler/runtime for interfacing with PyTorch Dynamo.☆102Updated 2 months ago
 - Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆40Updated 3 months ago
 - Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆63Updated 4 months ago