Easy fine-tuning for Qwen3-TTS: Fast voice cloning and high-quality multilingual speech synthesis.
☆67Apr 8, 2026Updated this week
Alternatives and similar repositories for Qwen3-TTS-EasyFinetuning
Users that are interested in Qwen3-TTS-EasyFinetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Testing sets for semanticVAD☆20Feb 18, 2025Updated last year
- ☆12Mar 11, 2025Updated last year
- FoF Upload,but with TencentCloud COS☆14Nov 10, 2024Updated last year
- WavBench: Benchmarking Reasoning, Colloquialism, and Paralinguistics for End-to-End Spoken Dialogue Models☆30Feb 13, 2026Updated last month
- Demo of fine-tuning QA models for answering FAQ of cloud providers documentation☆11Mar 7, 2023Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ComfyUI custom nodes for LongCat-AudioDiT \ Diffusion-based Zero-Shot Text-to-Speech☆84Apr 4, 2026Updated last week
- ☆12Feb 16, 2026Updated last month
- Whisper Speech Quality Assessment (WhiSQA)☆16Apr 3, 2026Updated last week
- Convert audio recordings of drums into MIDI files with Hidden Markov Models.☆11Jul 19, 2016Updated 9 years ago
- Official code for SongEcho☆55Mar 3, 2026Updated last month
- Template for creating audio encoders compatible with X-ARES☆19Feb 11, 2026Updated 2 months ago
- A streaming audio reader, processor, and writer built on top of soundfile, and PyAV (bindings for FFmpeg)☆38Mar 31, 2026Updated last week
- Official code for paper:"Speaking Clearly: A Simplified Whisper-Based Codec for Low-Bitrate Speech Coding"☆36Jan 28, 2026Updated 2 months ago
- Free ACELP vocoder☆17Sep 20, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆24Sep 9, 2024Updated last year
- PyTorch Implementation of [WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification](https://arxiv.or…☆17Jul 31, 2025Updated 8 months ago
- ☆34Sep 15, 2025Updated 6 months ago
- Audio-visual diarization pipeline used for creating VoxConverse dataset☆21Jun 6, 2025Updated 10 months ago
- Official PyTorch implementation of "EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational Autoencoders"☆14Sep 20, 2024Updated last year
- Googleの音声復元モデルMiipher-2の再現実装の学習および推論コード。学習済みモデルも公開しています。☆31Feb 7, 2026Updated 2 months ago
- Simple cloth simulation for Three.js WebGPU☆38Feb 20, 2026Updated last month
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- MelodySim: Measuring Melody-aware Music Similarity for Plagiarism Detection☆29May 29, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis☆38Dec 24, 2025Updated 3 months ago
- The implementation of g2pL with a new open dataset.☆16May 14, 2023Updated 2 years ago
- SoTA open-source TTS☆137Jun 7, 2025Updated 10 months ago
- ☆47Jul 7, 2025Updated 9 months ago
- Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-…☆24Feb 1, 2026Updated 2 months ago
- We design a spectral compression mapping (SCM) for full-band speech enhancement, and propose a two-stage stream named MHA-DPCRN☆24Jul 4, 2022Updated 3 years ago
- Pattern of Resume.☆17Aug 6, 2017Updated 8 years ago
- Collection of scripts from mHuBERT-147.☆34Nov 19, 2024Updated last year
- From Head to Tail: Efficient Black-box Model Inversion Attack via Long-tailed Learning - CVPR 2025☆16Mar 24, 2025Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Audio-to-Audio Schrodinger Bridges is a diffusion-based audio restoration model for bandwidth extension and inpainting.☆141Aug 13, 2025Updated 7 months ago
- A neural speech codec based on discrete WavLM representations☆26Aug 28, 2024Updated last year
- "Omni-R1: Towards the Unified Generative Paradigm for Multimodal Reasoning"☆62Jan 28, 2026Updated 2 months ago
- ☆40Jul 15, 2025Updated 8 months ago
- ☆59Dec 24, 2025Updated 3 months ago
- Fine-tuning toolkit for Chatterbox TTS & Chatterbox TURBO models. Supports 23 languages with smart vocabulary extension. Features offline…☆95Feb 20, 2026Updated last month
- ☆47Aug 28, 2025Updated 7 months ago