skit-ai / woc-tts-enhancementView external linksLinks
This is a winter of code project aimed at speech enhancement of text to speech models.
☆24Feb 6, 2022Updated 4 years ago
Alternatives and similar repositories for woc-tts-enhancement
Users that are interested in woc-tts-enhancement are comparing it to the libraries listed below
Sorting:
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆26Feb 22, 2024Updated last year
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Mar 24, 2023Updated 2 years ago
- ☆10Sep 2, 2024Updated last year
- Baseline kaldi script for UA-SPEECH corpus☆32Oct 16, 2024Updated last year
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆14Feb 13, 2022Updated 4 years ago
- SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech☆11Jun 30, 2023Updated 2 years ago
- ☆14Aug 16, 2023Updated 2 years ago
- chatterbox TTS + Voice Clone using onnx☆27Dec 31, 2025Updated last month
- ☆11Mar 22, 2023Updated 2 years ago
- The official implementation of the DIFFA series for dLLM-based large audio language model☆59Feb 2, 2026Updated 2 weeks ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Sep 27, 2024Updated last year
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 9 months ago
- Code for the paper "JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis"☆14Nov 5, 2024Updated last year
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆14Jun 28, 2024Updated last year
- StyleTTS2 + Vocos as a Decoder☆13Mar 24, 2025Updated 10 months ago
- Indic-Conformer models for ASR☆20Jul 19, 2024Updated last year
- An evaluation toolkit for voice conversion models.☆42Jul 11, 2021Updated 4 years ago
- ☆14Aug 19, 2024Updated last year
- Contrastive Bayesian Analysis for Deep Metric Learning and an Integrated Deep Metric Learning Toolbox Based on Pytorch☆13Dec 27, 2022Updated 3 years ago
- ☆16Mar 25, 2025Updated 10 months ago
- ☆15Sep 10, 2023Updated 2 years ago
- 端到端语音识别实现;包含LAS、CTC、RNNT解码方式,模型SA(MHA)、LSTM、CNN、DFSMN等☆15Jun 4, 2021Updated 4 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- ☆18Aug 23, 2024Updated last year
- ☆12Jun 10, 2021Updated 4 years ago
- ☆29Nov 4, 2025Updated 3 months ago
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆23Feb 2, 2026Updated 2 weeks ago
- Goodness of Pronunciation algorithm using PyKaldi☆18Jun 12, 2022Updated 3 years ago
- [SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition☆18Dec 1, 2024Updated last year
- Prompting Large Language Models with Audio for General-Purpose Speech Summarization☆19May 14, 2025Updated 9 months ago
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆21Sep 18, 2023Updated 2 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Jun 18, 2022Updated 3 years ago
- Python implementation of a few speech intelligibility prediction algorithms☆15May 29, 2024Updated last year
- Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"☆43May 23, 2023Updated 2 years ago
- ☆21Mar 4, 2024Updated last year
- ☆19Jul 16, 2023Updated 2 years ago
- [CVPR 2025] Official implementation of paper "Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie…☆23Jun 6, 2025Updated 8 months ago
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Jun 1, 2023Updated 2 years ago