biemster / gtts
Google Chrome Text to Speech command line client
☆34Updated 3 years ago
Alternatives and similar repositories for gtts
Users that are interested in gtts are comparing it to the libraries listed below
Sorting:
- Google Chrome SODA Offline Speech Recognition command line client☆157Updated 3 months ago
- Android offline speech recognition natively on PC☆52Updated 4 years ago
- This project aims to research google's offline speech recognition, from several android apps and ideally make them interoperable by repli…☆66Updated 5 years ago
- An even smaller speech recognizer / force aligner☆32Updated 5 months ago
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated last year
- C++ library for converting text to phonemes for Piper☆118Updated last year
- Coqui Inference Engine☆40Updated 3 years ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.☆64Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆214Updated last week
- On-device noise suppression powered by deep learning☆69Updated last week
- ☆36Updated last year
- Timething is a library for aligning text transcripts with their audio recordings.☆119Updated 5 months ago
- C++17 port of Open-Unmix-PyTorch with streaming LSTM inference, ggml, quantization, and Eigen☆46Updated 2 months ago
- IPA Phonemizer/Dephonemizer for 139 human languages☆26Updated last month
- SEPIA server to support open-source speech recognition via WebSocket connection.☆126Updated 6 months ago
- Port of Meta's Encodec in C/C++☆219Updated 5 months ago
- flask+tornado based NVIDIA tacotron2+waveglow tts web app☆28Updated last year
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated last year
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- ONNX Inference of Pyannote Segmentation☆87Updated 4 months ago
- On-device speaker diarization powered by deep learning☆45Updated last week
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆114Updated 2 years ago
- A recursive forced aligner built on Gentle.☆16Updated 6 years ago
- Zero-shot Audio Classification using Whisper☆80Updated 2 years ago
- ☆37Updated last year
- speaker diarization system using an LSTM☆50Updated 2 years ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆24Updated last year
- whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++☆65Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 7 months ago