A minimalist and extensible PyTorch extension for implementing custom backend operators in PyTorch.
☆39Jan 24, 2026Updated last month
Alternatives and similar repositories for My-Torch-Extension
Users that are interested in My-Torch-Extension are comparing it to the libraries listed below
Sorting:
- A practical way of learning Swizzle☆37Feb 3, 2025Updated last year
- ☆49Apr 15, 2024Updated last year
- SGEMM optimization with cuda step by step☆21Mar 23, 2024Updated last year
- 模型加速/模型压缩(已完成所有Lab)☆11Dec 24, 2023Updated 2 years ago
- High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)☆29Jan 22, 2026Updated last month
- Xmixers: A collection of SOTA efficient token/channel mixers☆28Sep 4, 2025Updated 6 months ago
- llama 2 Inference☆42Nov 4, 2023Updated 2 years ago
- [EMNLP2023]: MIRACLE: Towards Personalized Dialogue Generation with Latent-Space Multiple Personal Attribute Control☆12Nov 11, 2023Updated 2 years ago
- 使用 cutlass 实现 flash-attention 精简版,具有教学意义☆59Aug 12, 2024Updated last year
- ☆19Feb 2, 2023Updated 3 years ago
- A CUDA tutorial to make people learn CUDA program from 0