bytedance/matxscript

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bytedance/matxscript)

bytedance / matxscript

A high-performance, extensible Python AOT compiler.

☆449

Alternatives and similar repositories for matxscript

Users that are interested in matxscript are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bytedance / ByteTransformer
View on GitHub
optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
☆479Mar 15, 2024Updated 2 years ago
UofT-EcoSystem / DietCode
View on GitHub
DietCode Code Release
☆65Jul 21, 2022Updated 3 years ago
bytedance / byteir
View on GitHub
A model compilation solution for various hardware
☆473Aug 20, 2025Updated 11 months ago
awslabs / raf
View on GitHub
☆144Jan 30, 2025Updated last year
microsoft / nnfusion
View on GitHub
A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
☆1,002Sep 19, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
tlc-pack / relax
View on GitHub
☆193Mar 28, 2023Updated 3 years ago
bytedance / effective_transformer
View on GitHub
Running BERT without Padding
☆479Mar 18, 2022Updated 4 years ago
alibaba / BladeDISC
View on GitHub
BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.
☆932Dec 30, 2024Updated last year
awslabs / lorien
View on GitHub
☆42Sep 8, 2023Updated 2 years ago
comaniac / epoi
View on GitHub
Benchmark PyTorch Custom Operators
☆14Jul 6, 2023Updated 3 years ago
octoml / octoml-profile
View on GitHub
Home for OctoML PyTorch Profiler
☆114Apr 24, 2023Updated 3 years ago
pytorch / torchdynamo
View on GitHub
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
☆1,078Apr 17, 2024Updated 2 years ago
octoml / synr
View on GitHub
A library for syntactically rewriting Python programs, pronounced (sinner).
☆66Feb 22, 2022Updated 4 years ago
bytedance / byteps
View on GitHub
A high performance and generic framework for distributed DNN training
☆3,717Oct 3, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
apache / tvm-rfcs
View on GitHub
A home for the final text of all TVM RFCs.
☆111Sep 24, 2024Updated last year
awslabs / slapo
View on GitHub
A schedule language for large model training
☆153Aug 21, 2025Updated 10 months ago
CVCUDA / CV-CUDA
View on GitHub
CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.
☆2,709May 28, 2026Updated last month
tensorflow / mlir-hlo
View on GitHub
☆421Feb 24, 2026Updated 4 months ago
merrymercy / awesome-tensor-compilers
View on GitHub
A list of awesome compiler projects and papers for tensor computation and deep learning.
☆2,766Oct 19, 2024Updated last year
jiazhihao / TASO
View on GitHub
The Tensor Algebra SuperOptimizer for Deep Learning
☆742Jan 26, 2023Updated 3 years ago
chhzh123 / ptc-tutorial
View on GitHub
PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo
☆17Mar 13, 2023Updated 3 years ago
tlc-pack / cutlass_fpA_intB_gemm
View on GitHub
A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer
☆96Jun 21, 2026Updated 3 weeks ago
awslabs / ratex
View on GitHub
☆23Aug 21, 2025Updated 10 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Tencent / KsanaLLM
View on GitHub
☆544Updated this week
OpenPPL / ppl.nn
View on GitHub
A primitive library for neural network
☆1,367Nov 24, 2024Updated last year
bytedance / lightseq
View on GitHub
LightSeq: A High Performance Library for Sequence Processing and Generation
☆3,296May 16, 2023Updated 3 years ago
jiazhihao / attention_superoptimizer
View on GitHub
An Attention Superoptimizer
☆22Jan 20, 2025Updated last year
ByteDance-Seed / Triton-distributed
View on GitHub
Distributed Compiler based on Triton for Parallel Systems
☆1,494Updated this week
NVIDIA-Merlin / HierarchicalKV
View on GitHub
HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of…
☆208May 22, 2026Updated last month
flexflow / flexflow-train
View on GitHub
Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training
☆1,896Jul 1, 2026Updated 2 weeks ago
NVIDIA / FasterTransformer
View on GitHub
Transformer related optimization, including BERT, GPT
☆6,442Mar 27, 2024Updated 2 years ago
facebookresearch / fairring
View on GitHub
Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …
☆66Mar 21, 2022Updated 4 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
bytedance / flux
View on GitHub
A fast communication-overlapping library for tensor/expert parallelism on GPUs.
☆1,344Aug 28, 2025Updated 10 months ago
jundaf2 / INT8-Flash-Attention-FMHA-Quantization
View on GitHub
☆165Sep 15, 2023Updated 2 years ago
alibaba / heterogeneity-aware-lowering-and-optimization
View on GitHub
heterogeneity-aware-lowering-and-optimization
☆259Jan 20, 2024Updated 2 years ago
roastduck / FreeTensor
View on GitHub
A language and compiler for irregular tensor programs.
☆152Updated this week
hidet-org / hidet
View on GitHub
An open-source efficient deep learning framework/compiler, written in python.
☆743Sep 4, 2025Updated 10 months ago
BaguaSys / bagua-net
View on GitHub
High performance NCCL plugin for Bagua.
☆15Sep 15, 2021Updated 4 years ago
microsoft / SparTA
View on GitHub
☆167Jul 22, 2024Updated last year