dongbeiyewu / xlaLinks
☆22Updated 4 years ago
Alternatives and similar repositories for xla
Users that are interested in xla are comparing it to the libraries listed below
Sorting:
- A model compilation solution for various hardware☆438Updated 3 weeks ago
- Machine Learning Compiler Road Map☆43Updated last year
- Triton Compiler related materials.☆30Updated 6 months ago
- ☆23Updated 4 years ago
- Development repository for the Triton-Linalg conversion☆189Updated 5 months ago
- HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of…☆156Updated 2 weeks ago
- ☆195Updated 2 years ago
- a tensor computing compiler based tile programming for gpu, cpu or tpu☆44Updated last week
- A home for the final text of all TVM RFCs.☆105Updated 9 months ago
- examples for tvm schedule API☆101Updated 2 years ago
- Paella: Low-latency Model Serving with Virtualized GPU Scheduling☆59Updated last year
- TePDist (TEnsor Program DISTributed) is an HLO-level automatic distributed system for DL models.☆94Updated 2 years ago
- code reading for tvm☆76Updated 3 years ago
- ☆237Updated 3 weeks ago
- Hands-On Practical MLIR Tutorial☆521Updated last year
- ☆70Updated 2 years ago
- Shared Middle-Layer for Triton Compilation☆258Updated this week
- An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).☆607Updated last month
- Yinghan's Code Sample☆337Updated 2 years ago
- Benchmark Framework for Buddy Projects☆54Updated last month
- ☆123Updated 2 months ago
- ☆32Updated 2 years ago
- A simple high performance CUDA GEMM implementation.☆384Updated last year
- tensorflow源码阅读笔记☆191Updated 6 years ago
- Start AI Compiler☆39Updated 2 years ago
- ☆28Updated last year
- ☆148Updated 6 months ago
- CUDA PTX-ISA Document 中文翻译版☆43Updated last month
- ☆36Updated 6 months ago
- Some source code about matrix multiplication implementation on CUDA☆34Updated 6 years ago