dongbeiyewu / xlaLinks
☆22Updated 5 years ago
Alternatives and similar repositories for xla
Users that are interested in xla are comparing it to the libraries listed below
Sorting:
- A model compilation solution for various hardware☆446Updated 3 weeks ago
- Triton Compiler related materials.☆32Updated 8 months ago
- ☆195Updated 2 years ago
- An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).☆623Updated this week
- Machine Learning Compiler Road Map☆44Updated 2 years ago
- a tensor computing compiler based tile programming for gpu, cpu or tpu☆45Updated last week
- Development repository for the Triton-Linalg conversion☆197Updated 7 months ago
- HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of…☆169Updated last week
- examples for tvm schedule API☆101Updated 2 years ago
- Hands-On Practical MLIR Tutorial☆581Updated last year
- ☆24Updated 5 years ago
- ☆253Updated this week
- TePDist (TEnsor Program DISTributed) is an HLO-level automatic distributed system for DL models.☆95Updated 2 years ago
- Shared Middle-Layer for Triton Compilation☆285Updated last week
- A home for the final text of all TVM RFCs.☆106Updated 11 months ago
- This is a tutorial to learn LLVM, I realize a backend to compiler machine code for cpu0 which is a simple RISC cpu.☆253Updated 3 years ago
- ☆70Updated 2 years ago
- Yinghan's Code Sample☆345Updated 3 years ago
- CUDA PTX-ISA Document 中文翻译版☆44Updated 3 months ago
- code reading for tvm☆76Updated 3 years ago
- Benchmark Framework for Buddy Projects☆55Updated 2 months ago
- ☆28Updated last year
- 先进编译实验室的个人主页☆139Updated 4 months ago
- Play with MLIR right in your browser☆136Updated 2 years ago
- Start AI Compiler☆44Updated 2 years ago
- ☆153Updated 8 months ago
- A Easy-to-understand TensorOp Matmul Tutorial☆376Updated 11 months ago
- ☆139Updated 4 months ago
- Optimizing SGEMM kernel functions on NVIDIA GPUs to a close-to-cuBLAS performance.☆379Updated 8 months ago
- Paella: Low-latency Model Serving with Virtualized GPU Scheduling☆63Updated last year