TartuNLP / text-to-speech-apiLinks
REST API for neural text-to-speech synthesis
☆17Updated 2 weeks ago
Alternatives and similar repositories for text-to-speech-api
Users that are interested in text-to-speech-api are comparing it to the libraries listed below
Sorting:
- Apply an end-to-end model structure (ViT + GPT) to describe images in more detail, rather than traditional image captioning that only pro…☆11Updated 10 months ago
- Simple, Unified Repository for Retrieval-based Voice Conversion☆17Updated last year
- ☆13Updated 2 years ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Updated last year
- open-webui-runpod-integration☆14Updated 10 months ago
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆23Updated 2 months ago
- A simple voice conversion tool☆19Updated 3 years ago
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Updated 2 years ago
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated last year
- ☆14Updated 2 years ago
- Indic-Conformer models for ASR☆18Updated last year
- Zero-Shot Foreign Accent Conversion without a Native Reference☆35Updated last year
- This is a repository dedicated for pre-trained acoustic models of Hong Kong Cantonese and Cantonese forced alignment.☆20Updated last year
- Official repository of Tapir Lab.'s Lip-Sync Method☆10Updated 2 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆15Updated 11 months ago
- A Python neural network made with TensorFlow that converts one person's voice into another.☆10Updated 4 years ago
- We archive data because we are interested in the diffs. All data is from https://video-api.cartoonnetwork.com. We run the check every min…☆10Updated this week
- ☆56Updated 2 years ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆12Updated 11 months ago
- WIP: Ofen is a toolkit aimed at making transformer models production-ready. API included☆17Updated last year
- Run Retrieval-based Voice Conversion training and inference with ease.☆11Updated 9 months ago
- A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features☆10Updated 3 years ago
- ☆14Updated last year
- ☆13Updated 4 years ago
- Sample and Computation Redistribution for Efficient Face Detection☆15Updated last year
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated 2 years ago
- Run `npm i -g socrate` to install a discussion room for using GPT personalities with internal monologues to debate problems. Provide a pr…☆28Updated 2 years ago
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated last year
- ☆19Updated 8 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year