Triton kernel fusion for Qwen3-TTS 1.7B inference acceleration — RMSNorm, SwiGLU, M-RoPE, Norm+Residual
☆85Jun 7, 2026Updated last week
Alternatives and similar repositories for qwen3-tts-triton
Users that are interested in qwen3-tts-triton are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"☆36Oct 23, 2025Updated 7 months ago
- This repository collects papers related to Speech Tokenizer.☆18Oct 16, 2024Updated last year
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆40Jul 10, 2023Updated 2 years ago
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in …☆57Aug 9, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Crawled from FreeMidi.org, MIDI files library including over twenty thousand files!☆32Jun 6, 2020Updated 6 years ago
- Text-to-text alignment algorithm for speech recognition error analysis.☆30Apr 6, 2026Updated 2 months ago
- Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024☆16Nov 19, 2024Updated last year
- 一个将豆包 ASR 能力封装为 OpenAI 兼容接口的小项目,支持 Docker 启动,并提供一份可配合 Spokenly 使用的参考修正提示词,实现和 Typeless 类似的语音修正效果。☆40Feb 28, 2026Updated 3 months ago
- ACM MM 2024 FlashSpeech: Efficient Zero-Shot Speech Synthesis☆154Sep 20, 2024Updated last year
- Pybind11 bindings for Kaldi☆15Feb 1, 2026Updated 4 months ago
- ☆14Jan 2, 2025Updated last year
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Apr 25, 2023Updated 3 years ago
- C++ neural network library☆13Jul 2, 2016Updated 9 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆10Jun 4, 2016Updated 10 years ago
- Dataset, code and results repository for SBA-Net.☆14Sep 23, 2022Updated 3 years ago
- Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.☆96May 25, 2023Updated 3 years ago
- Code for paper A3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing☆89Sep 6, 2024Updated last year
- My attempt to improve the speed of the newton schulz algorithm, starting from the dion implementation.☆38Apr 30, 2026Updated last month
- A spoken version of the textual story cloze benchmark☆22Aug 6, 2023Updated 2 years ago
- ☆13Apr 10, 2020Updated 6 years ago
- ☆25Jun 2, 2026Updated 2 weeks ago
- Official PyTorch implementation of "EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational Autoencoders"☆14Sep 20, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆17May 16, 2025Updated last year
- Fine-tuning Moshi/J-Moshi on your own spoken dialogue data☆99Jan 5, 2026Updated 5 months ago
- Codes for paper <InteL-VAEs: Adding Inductive Biases to VariationalAuto-Encoders via Intermediary Latents>.☆18Jun 25, 2021Updated 4 years ago
- Weird autoencoder experiments☆25May 20, 2026Updated 3 weeks ago
- poorman's ar-dit tts☆45Dec 31, 2025Updated 5 months ago
- Paper Review about Speech Recognition · NLP☆10Mar 25, 2021Updated 5 years ago
- Exquisite video generation☆15Feb 18, 2024Updated 2 years ago
- Solves the longest common subsequence problem in Python☆19Dec 16, 2011Updated 14 years ago
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆109May 20, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…☆49Feb 17, 2026Updated 3 months ago
- The official implementation of AAAI2024 paper of "Scribble Hides Class: Promoting Scribble-based Semantic Segmentation with its Class Lab…☆17Oct 10, 2024Updated last year
- Github page for the preprint paper "InfoCatVAE: Representation Learning with Categorical Variational Autoencoders"☆14Oct 23, 2020Updated 5 years ago
- An open-source tool for automatic speech recognition ASR quality estimation.☆24Dec 12, 2019Updated 6 years ago
- AudioLDM text to audio colab☆18Nov 6, 2023Updated 2 years ago
- Pre-trained models for ISMIR 2019 Paper Large-Vocabulary Chord Transcription via Chord Structure Decomposition☆62Apr 9, 2024Updated 2 years ago
- VALL-E 한국어 버전☆12Aug 22, 2023Updated 2 years ago