bytedance/byteir

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bytedance/byteir)

bytedance / byteir

A model compilation solution for various hardware

☆473

Alternatives and similar repositories for byteir

Users that are interested in byteir are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MegEngine / MegCC
View on GitHub
MegCC是一个运行时超轻量，高效，移植简单的深度学习模型编译器
☆482Oct 23, 2024Updated last year
alibaba / BladeDISC
View on GitHub
BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.
☆932Dec 30, 2024Updated last year
buddy-compiler / buddy-mlir
View on GitHub
An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).
☆742Updated this week
tensorflow / mlir-hlo
View on GitHub
☆421Feb 24, 2026Updated 4 months ago
llvm / torch-mlir
View on GitHub
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
☆1,871Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Cambricon / triton-linalg
View on GitHub
Development repository for the Triton-Linalg conversion
☆221Feb 7, 2025Updated last year
microsoft / triton-shared
View on GitHub
Shared Middle-Layer for Triton Compilation
☆340Dec 5, 2025Updated 7 months ago
onnx / onnx-mlir
View on GitHub
Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure
☆1,037Updated this week
iree-org / iree
View on GitHub
A retargetable MLIR-based machine learning compiler and runtime toolkit.
☆3,850Updated this week
microsoft / nnfusion
View on GitHub
A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
☆1,002Sep 19, 2024Updated last year
KnowingNothing / MatmulTutorial
View on GitHub
A Easy-to-understand TensorOp Matmul Tutorial
☆445Mar 5, 2026Updated 4 months ago
ByteDance-Seed / Triton-distributed
View on GitHub
Distributed Compiler based on Triton for Parallel Systems
☆1,495Updated this week
bytedance / flux
View on GitHub
A fast communication-overlapping library for tensor/expert parallelism on GPUs.
☆1,344Aug 28, 2025Updated 10 months ago
merrymercy / awesome-tensor-compilers
View on GitHub
A list of awesome compiler projects and papers for tensor computation and deep learning.
☆2,767Oct 19, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
BBuf / tvm_mlir_learn
View on GitHub
compiler learning resources collect.
☆2,759May 20, 2026Updated 2 months ago
bytedance / xpu-perf
View on GitHub
[HPCA 2026] AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of…
☆369Apr 22, 2026Updated 3 months ago
bytedance / ByteTransformer
View on GitHub
optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
☆479Mar 15, 2024Updated 2 years ago
volcengine / veScale
View on GitHub
Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs
☆1,031Mar 3, 2026Updated 4 months ago
tlc-pack / libflash_attn
View on GitHub
Standalone Flash Attention v2 kernel without libtorch dependency
☆113Sep 10, 2024Updated last year
sophgo / tpu-mlir
View on GitHub
Machine learning compiler based on MLIR for Sophgo TPU.
☆949Updated this week
j2kun / mlir-tutorial
View on GitHub
MLIR For Beginners tutorial
☆1,329Jul 18, 2025Updated last year
spcl / pymlir
View on GitHub
Python interface for MLIR - the Multi-Level Intermediate Representation
☆271Nov 28, 2024Updated last year
yester31 / Cutlass_EX
View on GitHub
study of cutlass
☆22Nov 10, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
openxla / stablehlo
View on GitHub
Backward compatible ML compute opset inspired by HLO/MHLO
☆674Updated this week
iree-org / iree-turbine
View on GitHub
IREE's PyTorch Frontend, based on Torch Dynamo.
☆109Updated this week
KnowingNothing / compiler-and-arch
View on GitHub
A list of tutorials, paper, talks, and open-source projects for emerging compiler and architecture
☆531Jan 15, 2025Updated last year
KEKE046 / mlir-tutorial
View on GitHub
Hands-On Practical MLIR Tutorial
☆812Oct 20, 2023Updated 2 years ago
TiledTensor / TiledCUDA
View on GitHub
We invite you to visit and follow our new repository at https://github.com/microsoft/TileFusion. TiledCUDA is a highly efficient kernel …
☆192Jan 28, 2025Updated last year
iree-org / iree-nvgpu
View on GitHub
☆48Mar 5, 2024Updated 2 years ago
tlc-pack / cutlass_fpA_intB_gemm
View on GitHub
A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer
☆96Jun 21, 2026Updated last month
MLIR-China / mlir-playground
View on GitHub
Play with MLIR right in your browser
☆140May 25, 2023Updated 3 years ago
tpoisonooo / how-to-optimize-gemm
View on GitHub
row-major matmul optimization
☆743May 14, 2026Updated 2 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
bytedance / matxscript
View on GitHub
A high-performance, extensible Python AOT compiler.
☆449Sep 26, 2023Updated 2 years ago
llvm / eudsl
View on GitHub
Embedded Universal DSL: a good DSL for us, by us
☆76Updated this week
flame / how-to-optimize-gemm
View on GitHub
☆2,021Jul 29, 2023Updated 2 years ago
llvm / Polygeist
View on GitHub
C/C++ frontend for MLIR. Also features polyhedral optimizations, parallel optimizations, and more!
☆623Jun 19, 2025Updated last year
triton-lang / triton
View on GitHub
Development repository for the Triton language and compiler
☆19,746Updated this week
triton-lang / Triton-to-tile-IR
View on GitHub
incubator repo for CUDA-TileIR backend
☆148Jul 10, 2026Updated last week
microsoft / SparTA
View on GitHub
☆167Jul 22, 2024Updated 2 years ago