dongbeiyewu / xlaLinks
☆22Updated 5 years ago
Alternatives and similar repositories for xla
Users that are interested in xla are comparing it to the libraries listed below
Sorting:
- A model compilation solution for various hardware☆443Updated this week
- Triton Compiler related materials.☆31Updated 7 months ago
- Machine Learning Compiler Road Map☆43Updated last year
- Development repository for the Triton-Linalg conversion☆193Updated 6 months ago
- examples for tvm schedule API☆101Updated 2 years ago
- a tensor computing compiler based tile programming for gpu, cpu or tpu☆45Updated last month
- HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of…☆165Updated this week
- A home for the final text of all TVM RFCs.☆105Updated 11 months ago
- ☆70Updated 2 years ago
- ☆196Updated 2 years ago
- Hands-On Practical MLIR Tutorial☆570Updated last year
- An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).☆617Updated last week
- code reading for tvm☆76Updated 3 years ago
- ☆24Updated 4 years ago
- TePDist (TEnsor Program DISTributed) is an HLO-level automatic distributed system for DL models.☆95Updated 2 years ago
- Shared Middle-Layer for Triton Compilation☆268Updated last week
- Yinghan's Code Sample☆344Updated 3 years ago
- ☆249Updated 2 weeks ago
- Paella: Low-latency Model Serving with Virtualized GPU Scheduling☆60Updated last year
- ☆28Updated last year
- ☆105Updated 4 months ago
- ☆419Updated last week
- FlagTree is a unified compiler for multiple AI chips, which is forked from triton-lang/triton.☆72Updated this week
- Start AI Compiler☆41Updated 2 years ago
- This is a tutorial to learn LLVM, I realize a backend to compiler machine code for cpu0 which is a simple RISC cpu.☆251Updated 3 years ago
- Assembler and Decompiler for NVIDIA (Maxwell Pascal Volta Turing Ampere) GPUs.☆84Updated 2 years ago
- ☆135Updated 3 months ago
- ☆42Updated last month
- A simple high performance CUDA GEMM implementation.☆394Updated last year
- CUDA PTX-ISA Document 中文翻译版☆44Updated 2 months ago