justinchuby / model-explorer-onnx
ONNX Adapter for model-explorer
☆25Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for model-explorer-onnx
- ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.☆288Updated this week
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆35Updated 6 months ago
- Shared Middle-Layer for Triton Compilation☆192Updated this week
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆127Updated 3 weeks ago
- OpenAI Triton backend for Intel® GPUs☆143Updated this week
- High-speed GEMV kernels, at most 2.7x speedup compared to pytorch baseline.☆90Updated 4 months ago
- Experimental projects related to TensorRT☆81Updated this week
- ☆30Updated this week
- Home for OctoML PyTorch Profiler☆107Updated last year
- Applied AI experiments and examples for PyTorch☆168Updated 3 weeks ago
- An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).☆211Updated 3 weeks ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆271Updated this week
- ☆45Updated 2 weeks ago
- Development repository for the Triton language and compiler☆96Updated this week
- PyTorch emulation library for Microscaling (MX)-compatible data formats☆164Updated 2 months ago
- PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.☆100Updated 11 months ago
- Stores documents and resources used by the OpenXLA developer community☆107Updated 3 months ago
- Common utilities for ONNX converters☆252Updated 5 months ago
- This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic …☆83Updated this week
- extensible collectives library in triton☆72Updated 2 months ago
- The Triton backend for the ONNX Runtime.☆133Updated this week
- Simple and fast low-bit matmul kernels in CUDA / Triton☆147Updated this week
- IREE's PyTorch Frontend, based on Torch Dynamo.☆55Updated this week
- ☆12Updated last month
- onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime☆340Updated this week
- Standalone Flash Attention v2 kernel without libtorch dependency☆98Updated 2 months ago
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆153Updated this week
- An experimental CPU backend for Triton☆56Updated last week
- Benchmarks to capture important workloads.☆28Updated 5 months ago
- ☆153Updated this week