mediatechlab / tts-wrapper
TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.
☆21Updated 9 months ago
Alternatives and similar repositories for tts-wrapper:
Users that are interested in tts-wrapper are comparing it to the libraries listed below
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.☆20Updated 2 weeks ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- Zoom Audio Transcription offline☆32Updated 4 years ago
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated last year
- web based editor for subtitles and transcripts☆130Updated 8 months ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆36Updated 4 years ago
- Python ffmpeg wrapper for audio and video editing (trim, subtitles/overlay, concat, merge, & more!)☆23Updated 5 years ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- A project about learning how to synchronize subtitles in movies using machine learning.☆9Updated 2 years ago
- Simple Python audio transcriber using OpenAI's Whisper speech recognition model☆34Updated last month
- This repository is for wake-word detection in speech using recurrent neural networks☆17Updated 6 years ago
- A lightweight transcript editor for editing and correcting STT generated timed transcripts☆45Updated 3 weeks ago
- OCTRA is a web-application for the orthographic transcription of audio files.☆39Updated this week
- AudioStretchy is a Python wrapper around the `audio-stretch` C library, which performs fast, high-quality time-stretching of WAV/MP3 file…☆50Updated 7 months ago
- Interface for using TTS and vocoder models in the form of a text editor☆20Updated 2 years ago
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated last year
- Wav2vec resources and models for Brazilian Portuguese☆33Updated 2 years ago
- Installs FFMPEG v5 On Win32/Ubuntu/MacOS☆62Updated 3 weeks ago
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.☆28Updated 10 months ago
- A simple streamlit based webapp to process text and correct punctuation built using "fullstop-punctuation-multilang-large" Model from Hug…☆11Updated last year
- Auto Generate Subtitle File For Any Type Of Audio and Video. Using Python and Google Speech-to-Text API.☆14Updated 4 years ago
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- ☕🇧🇷 Scripts para o Kaldi em Português Brasileiro☆53Updated 2 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Updated 2 months ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Updated last year
- Automatically generate a music video by extracting scenes from another video☆31Updated last year
- ☆14Updated 2 years ago
- ☆18Updated 3 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆112Updated 2 years ago