onnx / ir-pyLinks
Efficient in-memory representation for ONNX, in Python
☆41Updated this week
Alternatives and similar repositories for ir-py
Users that are interested in ir-py are comparing it to the libraries listed below
Sorting:
- TORCH_TRACE parser for PT2☆72Updated last week
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆327Updated 3 weeks ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆48Updated 5 months ago
- ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.☆420Updated this week
- Visualize ONNX models with model-explorer☆67Updated 3 weeks ago
- TritonParse: A Compiler Tracer, Visualizer, and Reproducer for Triton Kernels☆189Updated this week
- Home for OctoML PyTorch Profiler☆113Updated 2 years ago
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆173Updated last week
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆375Updated this week
- High-Performance FP32 GEMM on CUDA devices☆117Updated last year
- Ahead of Time (AOT) Triton Math Library☆88Updated this week
- This library empowers users to seamlessly port pretrained models and checkpoints on the HuggingFace (HF) hub (developed using HF transfor…☆85Updated this week
- MLIR-based partitioning system☆162Updated this week
- Memory Optimizations for Deep Learning (ICML 2023)☆114Updated last year
- Model compression for ONNX☆99Updated last year
- ☆135Updated last week
- Common utilities for ONNX converters☆292Updated last month
- High-speed GEMV kernels, at most 2.7x speedup compared to pytorch baseline.☆127Updated last year
- python package of rocm-smi-lib☆24Updated last month
- Unified compiler/runtime for interfacing with PyTorch Dynamo.☆104Updated last month
- ☆344Updated 3 weeks ago
- Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.☆440Updated last month
- ☆59Updated this week
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆40Updated 6 months ago
- Explore training for quantized models☆26Updated 6 months ago
- This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic …☆104Updated this week
- A safetensors extension to efficiently store sparse quantized tensors on disk☆237Updated this week
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆130Updated last month
- TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.☆105Updated 7 months ago
- A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.☆732Updated this week