TartuNLP / text-to-speech-apiLinks
REST API for neural text-to-speech synthesis
☆16Updated 2 years ago
Alternatives and similar repositories for text-to-speech-api
Users that are interested in text-to-speech-api are comparing it to the libraries listed below
Sorting:
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Updated last year
 - Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 3 years ago
 - A composition of offline tools to achieve high quality multilingual speech to text transcription☆22Updated 2 months ago
 - Simple, Unified Repository for Retrieval-based Voice Conversion☆17Updated last year
 - A Python neural network made with TensorFlow that converts one person's voice into another.☆10Updated 4 years ago
 - KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Updated 2 years ago
 - ☆13Updated 2 years ago
 - Apply an end-to-end model structure (ViT + GPT) to describe images in more detail, rather than traditional image captioning that only pro…☆11Updated 9 months ago
 - Sample and Computation Redistribution for Efficient Face Detection☆14Updated last year
 - ☆14Updated 2 years ago
 - SERVER: Multi-modal Speech Emotion Recognition using Transformer-based and Vision-based Embeddings☆14Updated last year
 - A simple voice conversion tool☆19Updated 3 years ago
 - Indic-Conformer models for ASR☆18Updated last year
 - Zero-Shot Foreign Accent Conversion without a Native Reference☆35Updated last year
 - Diffusion Model for Voice Conversion☆17Updated 3 years ago
 - A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features☆10Updated 3 years ago
 - ☆14Updated 2 years ago
 - Run `npm i -g socrate` to install a discussion room for using GPT personalities with internal monologues to debate problems. Provide a pr…☆28Updated 2 years ago
 - A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆15Updated 11 months ago
 - [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing☆25Updated last year
 - SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago
 - ☆13Updated 4 years ago
 - ☆10Updated 2 years ago
 - ☆14Updated last year
 - ☆19Updated 7 months ago
 - ☆10Updated last year
 - A set of tools for working with accent data in Mozilla's Common Voice dataset☆14Updated 2 years ago
 - Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation☆17Updated 2 years ago
 - SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech☆11Updated 2 years ago
 - Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)☆20Updated 5 years ago