Triton kernel fusion for Qwen3-TTS 1.7B inference acceleration — RMSNorm, SwiGLU, M-RoPE, Norm+Residual
☆79Apr 17, 2026Updated last month
Alternatives and similar repositories for qwen3-tts-triton
Users that are interested in qwen3-tts-triton are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"☆35Oct 23, 2025Updated 7 months ago
- This repository collects papers related to Speech Tokenizer.☆18Oct 16, 2024Updated last year
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆40Jul 10, 2023Updated 2 years ago
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in …☆57Aug 9, 2025Updated 9 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Crawled from FreeMidi.org, MIDI files library including over twenty thousand files!☆32Jun 6, 2020Updated 5 years ago
- Text-to-text alignment algorithm for speech recognition error analysis.☆29Apr 6, 2026Updated last month
- Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024☆16Nov 19, 2024Updated last year
- 一个将豆包 ASR 能力封装为 OpenAI 兼容接口的小项目,支持 Docker 启动,并提供一份可配合 Spokenly 使用的参考修正提示词,实现和 Typeless 类似的语音修正效果。☆39Feb 28, 2026Updated 2 months ago
- ACM MM 2024 FlashSpeech: Efficient Zero-Shot Speech Synthesis☆154Sep 20, 2024Updated last year
- Pybind11 bindings for Kaldi☆15Feb 1, 2026Updated 3 months ago
- ☆14Jan 2, 2025Updated last year
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Apr 25, 2023Updated 3 years ago
- C++ neural network library☆13Jul 2, 2016Updated 9 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆10Jun 4, 2016Updated 9 years ago
- Dataset, code and results repository for SBA-Net.☆14Sep 23, 2022Updated 3 years ago
- Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.☆96May 25, 2023Updated 3 years ago
- Code for paper A3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing☆89Sep 6, 2024Updated last year
- My attempt to improve the speed of the newton schulz algorithm, starting from the dion implementation.☆38Apr 30, 2026Updated 3 weeks ago
- A spoken version of the textual story cloze benchmark☆22Aug 6, 2023Updated 2 years ago
- ☆12Apr 10, 2020Updated 6 years ago
- ☆25Jul 30, 2025Updated 9 months ago
- Official PyTorch implementation of "EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational Autoencoders"☆14Sep 20, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Fine-tuning Moshi/J-Moshi on your own spoken dialogue data☆98Jan 5, 2026Updated 4 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆17May 16, 2025Updated last year
- Codes for paper <InteL-VAEs: Adding Inductive Biases to VariationalAuto-Encoders via Intermediary Latents>.☆18Jun 25, 2021Updated 4 years ago
- Weird autoencoder experiments☆24Updated this week
- poorman's ar-dit tts☆45Dec 31, 2025Updated 4 months ago
- Paper Review about Speech Recognition · NLP☆10Mar 25, 2021Updated 5 years ago
- Exquisite video generation☆15Feb 18, 2024Updated 2 years ago
- Solves the longest common subsequence problem in Python☆19Dec 16, 2011Updated 14 years ago
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆109May 20, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…☆49Feb 17, 2026Updated 3 months ago
- The official implementation of AAAI2024 paper of "Scribble Hides Class: Promoting Scribble-based Semantic Segmentation with its Class Lab…☆17Oct 10, 2024Updated last year
- Github page for the preprint paper "InfoCatVAE: Representation Learning with Categorical Variational Autoencoders"☆14Oct 23, 2020Updated 5 years ago
- An open-source tool for automatic speech recognition ASR quality estimation.☆24Dec 12, 2019Updated 6 years ago
- AudioLDM text to audio colab☆18Nov 6, 2023Updated 2 years ago
- Pre-trained models for ISMIR 2019 Paper Large-Vocabulary Chord Transcription via Chord Structure Decomposition☆59Apr 9, 2024Updated 2 years ago
- VALL-E 한국어 버전☆12Aug 22, 2023Updated 2 years ago