biemster / gttsLinks
Google Chrome Text to Speech command line client
☆34Updated 3 years ago
Alternatives and similar repositories for gtts
Users that are interested in gtts are comparing it to the libraries listed below
Sorting:
- Google Chrome SODA Offline Speech Recognition command line client☆158Updated 4 months ago
- Android offline speech recognition natively on PC☆52Updated 4 years ago
- This project aims to research google's offline speech recognition, from several android apps and ideally make them interoperable by repli…☆67Updated 5 years ago
- C++17 port of Open-Unmix-PyTorch with streaming LSTM inference, ggml, quantization, and Eigen☆48Updated 3 months ago
- On-device speaker diarization powered by deep learning☆51Updated this week
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.☆64Updated last year
- Whisper combined with Silero VAD, for improved long-form transcriptions☆52Updated 2 years ago
- An even smaller speech recognizer / force aligner☆33Updated 6 months ago
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobile…☆41Updated 10 months ago
- On-device voice activity detection (VAD) powered by deep learning☆218Updated last week
- An unofficial PyTorch implementation of the StreamVC(Real-Time Low-Latency Voice Conversion)☆124Updated 10 months ago
- ONNX Inference of Pyannote Segmentation☆91Updated 6 months ago
- Snowboy reimplementation☆89Updated 3 years ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆99Updated 8 months ago
- Synchronize Whisper's timestamps over an existing accurate transcription☆152Updated last year
- ☆13Updated last month
- Coqui Inference Engine☆40Updated 3 years ago
- ☆29Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆115Updated 2 years ago
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.☆29Updated last year
- On-device noise suppression powered by deep learning☆73Updated this week
- Colab notebooks for Next-gen Kaldi☆27Updated 2 months ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆24Updated last year
- Timething is a library for aligning text transcripts with their audio recordings.☆121Updated 6 months ago
- openvino version of openai/whisper☆167Updated last year
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆165Updated 2 weeks ago
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆14Updated 7 months ago
- C++ library for converting text to phonemes for Piper☆122Updated last year