☆54Mar 15, 2025Updated last year
Alternatives and similar repositories for torch_mlu
Users that are interested in torch_mlu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Nov 28, 2023Updated 2 years ago
- ☆13Sep 19, 2023Updated 2 years ago
- Development repository for the Triton-Linalg conversion☆218Feb 7, 2025Updated last year
- Ascend PyTorch adapter (torch_npu). Mirror of https://gitcode.com/Ascend/pytorch☆499Apr 9, 2026Updated last week
- ☆33Apr 20, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆57Feb 24, 2026Updated last month
- See vLLM official support: https://github.com/vllm-project/vllm-ascend☆11Feb 5, 2025Updated last year
- Shared Middle-Layer for Triton Compilation☆330Dec 5, 2025Updated 4 months ago
- A PyTorch native platform for training generative AI models☆16Nov 18, 2025Updated 4 months ago
- Arrow Matrix Decomposition - Communication-Efficient Distributed Sparse Matrix Multiplication☆15Mar 25, 2024Updated 2 years ago
- Development repository for the Triton language and compiler☆144Updated this week
- ☆11Jun 14, 2024Updated last year
- FlagCX is a scalable and adaptive cross-chip communication library.☆184Apr 7, 2026Updated last week
- ☆13Jun 18, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆22Dec 7, 2023Updated 2 years ago
- Optimize softmax in triton in many cases☆23Sep 6, 2024Updated last year
- 面向多平台编译优化的深度学习中间表示☆10Oct 28, 2024Updated last year
- Efficient kernel for RMS normalization with fused operations, includes both forward and backward passes, compatibility with PyTorch.☆13Jun 5, 2024Updated last year
- Example of using pytorch's open device registration API☆31Oct 14, 2022Updated 3 years ago
- ☆62Apr 3, 2026Updated last week
- ☆20Jun 13, 2025Updated 10 months ago
- a Tensorflow version of Faster Rcnn for ICPR2018 text detection☆13May 28, 2018Updated 7 years ago
- Torch Plasma Simulator☆10Apr 5, 2026Updated last week
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- libFastMesh - Optimized Finite Volume Computational Aeroacoustics (CAA) Code☆13Mar 28, 2024Updated 2 years ago
- A practical way of learning Swizzle☆37Feb 3, 2025Updated last year
- A cutlass cute implementation of headdim-64 flashattentionv2 TensorRT plugin for LightGlue. Run on Jetson Orin NX 8GB with TensorRT 8.5.…☆20Mar 3, 2025Updated last year
- ☆105Sep 9, 2024Updated last year
- ☆46Jul 16, 2025Updated 9 months ago
- Opencv ARM Linux precompiled library☆14Apr 3, 2021Updated 5 years ago
- Triton adapter for Ascend. Mirror of https://gitcode.com/ascend/triton-ascend☆119Updated this week
- Triton Documentation in Chinese Simplified / Triton 中文文档☆111Mar 5, 2026Updated last month
- TORCH_TRACE parser for PT2☆84Apr 9, 2026Updated last week
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A simple pseudo-spectral solver for the Direct Numerical Simulation (DNS) of the 3D Taylor-Green Vortex in the Julia programming language☆10Jun 6, 2022Updated 3 years ago
- A framework to compare low-bit integer and float-point formats☆72Feb 6, 2026Updated 2 months ago
- Gallatin is a general-purpose memory manager for CUDA that allows for threads to quickly malloc and free memory of arbitrary size inside …☆25Mar 27, 2026Updated 2 weeks ago
- A GPU-accelerated differentiable fluid simulator written in JAX.☆11Feb 1, 2021Updated 5 years ago
- ☆21Mar 22, 2021Updated 5 years ago
- Penn CIS 5650 (GPU Programming and Architecture) Final Project☆44Dec 11, 2023Updated 2 years ago
- Simple intermediate representation language for learning and research.☆20Mar 27, 2020Updated 6 years ago