TartuNLP / text-to-speech-apiLinks
REST API for neural text-to-speech synthesis
☆16Updated 2 years ago
Alternatives and similar repositories for text-to-speech-api
Users that are interested in text-to-speech-api are comparing it to the libraries listed below
Sorting:
- Simple, Unified Repository for Retrieval-based Voice Conversion☆17Updated last year
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆19Updated this week
- A complete end-to-end Deep Learning system to generate high quality human like speech in English for Korean Drama (WIP)☆13Updated 2 years ago
- Cantonese Selfish Project 廣東話自肥企劃 at PYCON HK 2021☆15Updated 3 years ago
- Apply an end-to-end model structure (ViT + GPT) to describe images in more detail, rather than traditional image captioning that only pro…☆11Updated 7 months ago
- ☆54Updated 2 years ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Updated 4 years ago
- ☆13Updated 2 years ago
- A set of tools for working with accent data in Mozilla's Common Voice dataset☆13Updated last year
- 🏥 🎤 The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances…☆29Updated 5 months ago
- A simple voice conversion tool☆18Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- This is a repository dedicated for pre-trained acoustic models of Hong Kong Cantonese and Cantonese forced alignment.☆16Updated 9 months ago
- Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)☆20Updated 5 years ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆12Updated 11 months ago
- A Python neural network made with TensorFlow that converts one person's voice into another.☆10Updated 4 years ago
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- Run `npm i -g socrate` to install a discussion room for using GPT personalities with internal monologues to debate problems. Provide a pr…☆28Updated 2 years ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- Ready-to-use Multilingual Text-To-Speech (TTS) package.☆23Updated 2 years ago
- ☆28Updated 4 years ago
- Indic-Conformer models for ASR☆18Updated last year
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Updated last year
- Deep metric learning: Triplet, Magnet and VMF loss☆11Updated 3 years ago
- SERVER: Multi-modal Speech Emotion Recognition using Transformer-based and Vision-based Embeddings☆14Updated last year
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆14Updated 9 months ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆34Updated last year
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago
- ☆43Updated last year
- A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate2, hosted on Hugging Face Spaces.☆10Updated this week