biemster / gttsLinks
Google Chrome Text to Speech command line client
☆34Updated 4 years ago
Alternatives and similar repositories for gtts
Users that are interested in gtts are comparing it to the libraries listed below
Sorting:
- Google Chrome SODA Offline Speech Recognition command line client☆159Updated 5 months ago
- Android offline speech recognition natively on PC☆52Updated 4 years ago
- This project aims to research google's offline speech recognition, from several android apps and ideally make them interoperable by repli…☆67Updated 5 years ago
- On-device voice activity detection (VAD) powered by deep learning☆220Updated last week
- Synchronize Whisper's timestamps over an existing accurate transcription☆153Updated last year
- On-device noise suppression powered by deep learning☆73Updated this week
- An even smaller speech recognizer / force aligner☆35Updated 7 months ago
- A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.☆64Updated last year
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- C++17 port of Open-Unmix-PyTorch with streaming LSTM inference, ggml, quantization, and Eigen☆48Updated 4 months ago
- C++ library for converting text to phonemes for Piper☆128Updated last week
- SEPIA server to support open-source speech recognition via WebSocket connection.☆128Updated 8 months ago
- Coqui Inference Engine☆40Updated 3 years ago
- ez audio transcription tool with flexible processing and post-processing options☆155Updated last year
- 🐸STT integration examples☆129Updated 2 years ago
- ☆146Updated last year
- Port of Meta's Encodec in C/C++☆226Updated 7 months ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆78Updated 2 years ago
- Open models for Coqui STT☆141Updated 2 years ago
- IPA Phonemizer/Dephonemizer for 139 human languages☆30Updated this week
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆332Updated 8 months ago
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobile…☆42Updated 10 months ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- On-device speaker diarization powered by deep learning☆51Updated this week
- ☆59Updated 5 months ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆37Updated last week
- ☆13Updated 2 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆116Updated 2 years ago
- Timething is a library for aligning text transcripts with their audio recordings.☆122Updated 7 months ago