Ascend/pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Ascend/pytorch)

Ascend / pytorch

Ascend PyTorch adapter (torch_npu). Mirror of https://gitcode.com/Ascend/pytorch

☆553

Alternatives and similar repositories for pytorch

Users that are interested in pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Ascend / torchair
View on GitHub
☆26Jun 8, 2026Updated last month
Ascend / triton-ascend
View on GitHub
Triton adapter for Ascend. Mirror of https://gitcode.com/ascend/triton-ascend
☆127May 18, 2026Updated 2 months ago
Cambricon / torch_mlu
View on GitHub
☆57Mar 15, 2025Updated last year
bdhirsh / pytorch_open_registration_example
View on GitHub
Example of using pytorch's open device registration API
☆31Oct 14, 2022Updated 3 years ago
vllm-project / vllm-ascend
View on GitHub
Community maintained hardware plugin for vLLM on Ascend
☆2,478Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Ascend / AscendSpeed
View on GitHub
☆79Dec 15, 2023Updated 2 years ago
tile-ai / tilelang-ascend
View on GitHub
Ascend TileLang adapter
☆338Updated this week
cosdt / vllm-ascend
View on GitHub
See vLLM official support: https://github.com/vllm-project/vllm-ascend
☆11Feb 5, 2025Updated last year
BrightXiaoHan / optimum-ascend
View on GitHub
Optimized inference with Ascend and Hugging Face
☆12Apr 23, 2024Updated 2 years ago
flagos-ai / FlagGems
View on GitHub
FlagGems is an operator library for large language models implemented in the Triton Language.
☆1,057Updated this week
NVIDIA / Megatron-LM
View on GitHub
Ongoing research training transformer models at scale
☆17,212Updated this week
Cambricon / catch
View on GitHub
☆33Apr 20, 2023Updated 3 years ago
Dao-AILab / flash-attention
View on GitHub
Fast and memory-efficient exact attention
☆24,531Updated this week
mindspore-ai / mindspore
View on GitHub
MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.
☆4,697Jul 29, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ModelTC / LightLLM
View on GitHub
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalabili…
☆4,193Updated this week
kvcache-ai / Mooncake
View on GitHub
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
☆5,999Updated this week
DeepLink-org / ditorch
View on GitHub
☆31Jan 7, 2025Updated last year
Cambricon / triton-linalg
View on GitHub
Development repository for the Triton-Linalg conversion
☆221Feb 7, 2025Updated last year
NVIDIA / TransformerEngine
View on GitHub
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on H…
☆3,448Updated this week
triton-lang / triton
View on GitHub
Development repository for the Triton language and compiler
☆19,782Updated this week
openucx / ucc
View on GitHub
Unified Collective Communication Library
☆311Jul 17, 2026Updated last week
flashinfer-ai / flashinfer
View on GitHub
FlashInfer: Kernel Library for LLM Serving
☆6,032Updated this week
sgl-project / sglang
View on GitHub
SGLang is a high-performance serving framework for large language models and multimodal models.
☆30,733Updated this week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
NVIDIA / FasterTransformer
View on GitHub
Transformer related optimization, including BERT, GPT
☆6,444Mar 27, 2024Updated 2 years ago
deepspeedai / DeepSpeed-Kernels
View on GitHub
☆76Mar 26, 2025Updated last year
vllm-project / flash-attention
View on GitHub
Fast and memory-efficient exact attention
☆133Updated this week
NVIDIA / nccl
View on GitHub
Optimized primitives for collective multi-GPU communication
☆4,905Updated this week
NVIDIA / cutlass
View on GitHub
CUDA Templates and Python DSLs for High-Performance Linear Algebra
☆10,125Updated this week
alibaba / BladeDISC
View on GitHub
BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.
☆932Dec 30, 2024Updated last year
DeepLink-org / DLSlime
View on GitHub
Composable and Embeddable Communication Runtime for Distributed AI Services
☆102Jun 5, 2026Updated last month
DeepLink-org / DIOPI
View on GitHub
☆76Nov 22, 2024Updated last year
NVIDIA / TensorRT-LLM
View on GitHub
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizat…
☆14,205Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
tile-ai / tilelang
View on GitHub
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
☆6,908Updated this week
LMCache / LMCache-Ascend
View on GitHub
LMCache on Ascend
☆82Updated this week
fla-org / flash-linear-attention
View on GitHub
🚀 Efficient implementations for emerging model architectures
☆5,414Updated this week
sgl-project / sgl-kernel-npu
View on GitHub
SGLang kernel library for NPU
☆171Updated this week
InternLM / lmdeploy
View on GitHub
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
☆7,972Updated this week
vllm-project / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆87,138Updated this week
NVIDIA / nccl-tests
View on GitHub
NCCL Tests
☆1,603Jul 9, 2026Updated 2 weeks ago