A minimalist and extensible PyTorch extension for implementing custom backend operators in PyTorch.
☆41Jan 24, 2026Updated 5 months ago
Alternatives and similar repositories for My-Torch-Extension
Users that are interested in My-Torch-Extension are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A practical way of learning Swizzle☆42Feb 3, 2025Updated last year
- ☆49Apr 15, 2024Updated 2 years ago
- High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)☆30Jan 22, 2026Updated 5 months ago
- Xmixers: A collection of SOTA efficient token/channel mixers☆28Sep 4, 2025Updated 9 months ago
- llama 2 Inference☆43Nov 4, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for "An Introduction to Tensor Tiling in MLIR" tutorial given at EuroLLVM 2025☆23Jun 5, 2025Updated last year
- ☆13Sep 12, 2024Updated last year
- ☆19Feb 2, 2023Updated 3 years ago
- A CUDA tutorial to make people learn CUDA program from 0☆279Jul 9, 2024Updated last year
- Anderson points-to analysis implementation based on LLVM☆12Jan 3, 2021Updated 5 years ago
- 个人学习编译原理、理解创造一个编译器主体流程的小项目☆10Oct 7, 2020Updated 5 years ago
- Triton to TVM transpiler.☆23Oct 14, 2024Updated last year
- UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)☆16Oct 27, 2023Updated 2 years ago
- Vocal Remover using Deep Neural Networks☆21Dec 31, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Mutiband version of HIFIGAN☆19Nov 6, 2020Updated 5 years ago
- [TOIS 2024] Target-constrained Bidirectional Planning for Generation of Target-oriented Proactive Dialogue☆13Oct 18, 2025Updated 8 months ago
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆43Dec 29, 2025Updated 6 months ago
- Just another FastSpeech 2 but cleaner code :)☆29Jun 28, 2024Updated 2 years ago
- Code and data for "Timo: Towards Better Temporal Reasoning for Language Models" (COLM 2024)☆26Oct 23, 2024Updated last year
- Decoding Attention is specially optimized for MHA, MQA, GQA and MLA using CUDA core for the decoding stage of LLM inference.☆47Jun 11, 2025Updated last year
- Voice conversion with just linear regression.☆37Sep 25, 2025Updated 9 months ago
- how to optimize some algorithm in cuda.☆3,102Jun 24, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Benchmark tests supporting the TiledCUDA library.☆19Nov 19, 2024Updated last year
- ☆138Feb 4, 2026Updated 4 months ago
- An open-sourced PyTorch library for developing energy efficient multiplication-less models and applications.☆14Feb 3, 2025Updated last year
- Simple and efficient memory pool is implemented with C++11.☆10Jun 2, 2022Updated 4 years ago
- Multispeaker Community Vocoder Model for DiffSinger☆38Aug 11, 2025Updated 10 months ago
- Official repository Flash Local Linear Attention☆37May 28, 2026Updated last month
- This is the accompanying repository to the paper - Automatic Estimation of Singing Voice Musical Dynamics☆15Oct 28, 2024Updated last year
- pytorch版基于gpt+nezha的中文多轮Cdial☆11Oct 22, 2022Updated 3 years ago
- 数据库内核笔记☆14Aug 18, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- pytorch 大规模数据读取dataset☆13May 30, 2022Updated 4 years ago
- UESTC-《Parallel and Distributed Computing》Course Experiment(电子科技大学 《分布式并行计算》课程实验)-Nvidia CUDA Course on https://courses.nvidia.com/course…☆10Jun 18, 2019Updated 7 years ago
- Open-Pandora: On-the-fly Control Video Generation☆35Nov 28, 2024Updated last year
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆35Apr 22, 2024Updated 2 years ago
- CUDA Matrix Multiplication Optimization☆275Jul 19, 2024Updated last year
- Terminal UI for NVIDIA Nsight Systems profiles — timeline viewer, kernel navigator, NVTX hierarchy☆60Jun 18, 2026Updated last week
- [NAACL 2025] Beyond End-to-End VLMs: Leveraging Intermediate Text Representations for Superior Flowchart Understanding☆21Aug 23, 2025Updated 10 months ago