justinchuby / model-explorer-onnx
ONNX Adapter for model-explorer
☆25Updated 3 months ago
Alternatives and similar repositories for model-explorer-onnx:
Users that are interested in model-explorer-onnx are comparing it to the libraries listed below
- ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.☆304Updated this week
- The no-code AI toolchain☆80Updated this week
- Notes and artifacts from the ONNX steering committee☆25Updated last week
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆134Updated this week
- Fast low-bit matmul kernels in Triton☆187Updated last week
- Applied AI experiments and examples for PyTorch☆211Updated this week
- Common utilities for ONNX converters☆256Updated last month
- OpenAI Triton backend for Intel® GPUs☆154Updated this week
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆37Updated 8 months ago
- Model compression for ONNX☆80Updated 2 months ago
- High-speed GEMV kernels, at most 2.7x speedup compared to pytorch baseline.☆93Updated 6 months ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆291Updated this week
- extensible collectives library in triton☆76Updated 3 months ago
- This repository contains the experimental PyTorch native float8 training UX☆219Updated 5 months ago
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆244Updated 9 months ago
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆238Updated this week
- onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime☆349Updated this week
- Home for OctoML PyTorch Profiler☆107Updated last year
- Shared Middle-Layer for Triton Compilation☆220Updated this week
- Collection of kernels written in Triton language☆90Updated 2 months ago
- Fastest kernels written from scratch☆118Updated last month
- PyTorch emulation library for Microscaling (MX)-compatible data formats☆187Updated 3 months ago
- The Triton backend for the ONNX Runtime.☆136Updated this week
- Experimental projects related to TensorRT☆86Updated this week
- ☆20Updated 2 months ago
- ☆33Updated this week
- An experimental CPU backend for Triton☆75Updated this week
- Unified compiler/runtime for interfacing with PyTorch Dynamo.☆99Updated this week
- PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.☆103Updated last month
- MLIR-based partitioning system☆56Updated this week