TartuNLP / text-to-speech-api
REST API for neural text-to-speech synthesis
☆15Updated 2 years ago
Alternatives and similar repositories for text-to-speech-api:
Users that are interested in text-to-speech-api are comparing it to the libraries listed below
- ☆53Updated 2 years ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- ☆13Updated 8 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- Implement SSML parsing for Web Speech API☆36Updated 4 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆15Updated 4 months ago
- A simple voice conversion tool☆17Updated 3 years ago
- Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech☆16Updated 2 months ago
- a simple system for 2-way interruptible voice interactions between human and LLM☆28Updated last year
- ☆14Updated last year
- ☆17Updated 2 years ago
- The Vokan Architecture (Tsukasa speech based)☆9Updated 2 months ago
- A pipeline to isolate and transcribe one language in mixed-language speech☆18Updated 2 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆36Updated 4 years ago
- Collection of scripts from mHuBERT-147.☆24Updated 5 months ago
- ☆26Updated last year
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Updated last year
- ☆11Updated 2 months ago
- Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large…☆20Updated 11 months ago
- Official PyTorch implementation of TTS Style Transfer☆23Updated 2 years ago
- Diffusion Model for Voice Conversion☆17Updated 2 years ago
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆38Updated last week
- ☆79Updated 11 months ago
- 'Grad-TTS' with Multilingual Cleaners☆10Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated 2 weeks ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- ☆8Updated 8 months ago
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆24Updated last year
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.☆36Updated 2 years ago