microsoft / onnxscript
ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.
☆270Updated this week
Related projects: ⓘ
- Common utilities for ONNX converters☆245Updated 2 months ago
- onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime☆319Updated this week
- Accelerate PyTorch models with ONNX Runtime☆353Updated 2 weeks ago
- The Triton backend for the ONNX Runtime.☆122Updated last week
- TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, sparsity, distillat…☆434Updated last week
- torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters i…☆173Updated 2 months ago
- Actively maintained ONNX Optimizer☆634Updated 6 months ago
- Examples for using ONNX Runtime for model training.☆301Updated last month
- Scailable ONNX python tools☆96Updated 9 months ago
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆994Updated 5 months ago
- 🤗 Optimum Intel: Accelerate inference with Intel optimization tools☆380Updated this week
- Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.☆176Updated last week
- Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…☆419Updated last week
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆250Updated this week
- Generative AI extensions for onnxruntime☆421Updated this week
- A library to analyze PyTorch traces.☆270Updated last week
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆144Updated this week
- The Triton backend for the PyTorch TorchScript models.☆117Updated last week
- A code generator from ONNX to PyTorch code☆132Updated last year
- An open-source efficient deep learning framework/compiler, written in python.☆646Updated 3 weeks ago
- Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure☆742Updated this week
- PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.☆715Updated last month
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆324Updated last week
- A pytorch quantization backend for optimum☆758Updated this week
- PyTorch RFCs (experimental)☆120Updated 3 weeks ago
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆144Updated last month
- PyTorch native quantization and sparsity for training and inference☆726Updated this week
- Common source, scripts and utilities for creating Triton backends.☆280Updated last week
- Pipeline Parallelism for PyTorch☆708Updated 3 weeks ago
- Backward compatible ML compute opset inspired by HLO/MHLO☆380Updated last week