☆54Mar 15, 2025Updated 11 months ago
Alternatives and similar repositories for torch_mlu
Users that are interested in torch_mlu are comparing it to the libraries listed below
Sorting:
- Development repository for the Triton-Linalg conversion☆215Feb 7, 2025Updated last year
- ☆14Nov 28, 2023Updated 2 years ago
- Efficient operation implementation based on the Cambricon Machine Learning Unit (MLU) .☆155Feb 27, 2026Updated last week
- A Triton JIT runtime and ffi provider in C++☆32Updated this week
- ☆33Apr 20, 2023Updated 2 years ago
- Ascend PyTorch adapter (torch_npu). Mirror of https://gitcode.com/Ascend/pytorch☆491Updated this week
- ☆22Dec 7, 2023Updated 2 years ago
- Development repository for the Triton language and compiler☆141Updated this week
- MultiArchKernelBench: A Multi-Platform Benchmark for Kernel Generation☆43Feb 8, 2026Updated 3 weeks ago
- A practical way of learning Swizzle☆37Feb 3, 2025Updated last year
- Shared Middle-Layer for Triton Compilation☆329Dec 5, 2025Updated 3 months ago
- ☆21Mar 22, 2021Updated 4 years ago
- ☆104Sep 9, 2024Updated last year
- Optimize softmax in triton in many cases☆23Sep 6, 2024Updated last year
- FlagCX is a scalable and adaptive cross-chip communication library.☆174Feb 27, 2026Updated last week
- PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation☆32Nov 16, 2024Updated last year
- Nex Venus Communication Library☆72Nov 17, 2025Updated 3 months ago
- ☆74Updated this week
- A lightweight reinforcement learning framework that integrates seamlessly into your codebase, empowering developers to focus on algorithm…☆101Aug 25, 2025Updated 6 months ago
- Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models☆45Sep 19, 2025Updated 5 months ago
- ☆57Feb 2, 2026Updated last month
- Triton Compiler related materials.☆42Jan 4, 2025Updated last year
- Example of using pytorch's open device registration API☆31Oct 14, 2022Updated 3 years ago
- [NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer☆30Dec 6, 2023Updated 2 years ago
- All Resources from Stanford CS106B 2021☆24Jul 11, 2025Updated 7 months ago
- extensible collectives library in triton☆96Mar 31, 2025Updated 11 months ago
- A domain-specific language (DSL) based on Triton but providing higher-level abstractions.☆41Feb 4, 2026Updated last month
- Triton Documentation in Chinese Simplified / Triton 中文文档☆105Dec 17, 2025Updated 2 months ago
- ☆169Updated this week
- ☆262Jul 11, 2024Updated last year
- Framework to reduce autotune overhead to zero for well known deployments.☆97Sep 19, 2025Updated 5 months ago
- Protocol buffers and other common resources.☆13Updated this week
- 面向多平台编译优化的深度学习中间表示☆10Oct 28, 2024Updated last year
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆16Sep 7, 2024Updated last year
- Pipeline for analyzing rare mutations in metagenome-assembled genomes☆10Apr 4, 2025Updated 11 months ago
- ☆13Dec 3, 2024Updated last year
- Penn CIS 5650 (GPU Programming and Architecture) Final Project☆44Dec 11, 2023Updated 2 years ago
- libFastMesh - Optimized Finite Volume Computational Aeroacoustics (CAA) Code☆13Mar 28, 2024Updated last year
- ☆14Jan 23, 2026Updated last month