microsoft / onnxruntime-tvmLinks
Open deep learning compiler stack for cpu, gpu and specialized accelerators
☆34Updated 2 years ago
Alternatives and similar repositories for onnxruntime-tvm
Users that are interested in onnxruntime-tvm are comparing it to the libraries listed below
Sorting:
- ONNX Serving is a project written with C++ to serve onnx-mlir compiled models with GRPC and other protocols.Benefiting from C++ implement…☆24Updated 2 months ago
- ☆111Updated last week
- AMD's graph optimization engine.☆228Updated this week
- The Triton backend for the ONNX Runtime.☆155Updated last week
- ☆69Updated 2 years ago
- how to design cpu gemm on x86 with avx256, that can beat openblas.☆70Updated 6 years ago
- Notes and artifacts from the ONNX steering committee☆26Updated last week
- ☆310Updated 6 months ago
- Common utilities for ONNX converters☆274Updated 2 weeks ago
- Fast sparse deep learning on CPUs☆53Updated 2 years ago
- TensorFlow and TVM integration☆37Updated 5 years ago
- heterogeneity-aware-lowering-and-optimization☆255Updated last year
- This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiai☆56Updated this week
- ☆161Updated last week
- ☆68Updated 2 years ago
- Inference framework for MoE layers based on TensorRT with Python binding☆41Updated 4 years ago
- ☆13Updated 5 years ago
- Computation using data flow graphs for scalable machine learning☆68Updated this week
- Unified compiler/runtime for interfacing with PyTorch Dynamo.☆100Updated last week
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆133Updated last year
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆43Updated 4 months ago
- Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …☆106Updated 6 months ago
- TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together☆64Updated 7 years ago
- Common source, scripts and utilities shared across all Triton repositories.☆74Updated last week
- A GPU-driven system framework for scalable AI applications☆117Updated 5 months ago
- The core library and APIs implementing the Triton Inference Server.☆138Updated this week
- ☆124Updated last year
- ☆74Updated 3 months ago
- AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and ver…☆251Updated 2 weeks ago
- Symbolic Expression and Statement Module for new DSLs☆205Updated 4 years ago