biemster / gtts
Google Chrome Text to Speech command line client
☆32Updated 3 years ago
Alternatives and similar repositories for gtts:
Users that are interested in gtts are comparing it to the libraries listed below
- Google Chrome SODA Offline Speech Recognition command line client☆154Updated 2 weeks ago
- Android offline speech recognition natively on PC☆50Updated 4 years ago
- This project aims to research google's offline speech recognition, from several android apps and ideally make them interoperable by repli…☆63Updated 4 years ago
- An even smaller speech recognizer / force aligner☆32Updated last month
- Experiments to test different speech recognition systems for SEPIA Framework☆58Updated last year
- On-device noise suppression powered by deep learning☆66Updated this week
- C++ version of pyannote audio speaker diarizaiton pipeline☆20Updated last year
- A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.☆65Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆197Updated this week
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆35Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆142Updated last year
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobile…☆38Updated 5 months ago
- Speaker Diarization with Transformers☆64Updated 8 months ago
- Port of Meta's Encodec in C/C++☆215Updated 2 months ago
- Snowboy reimplementation☆83Updated 2 years ago
- Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used to…☆32Updated 4 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆92Updated 4 months ago
- flask+tornado based NVIDIA tacotron2+waveglow tts web app☆28Updated last year
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated last year
- Faster Whisper ASR transcription with CTranslate2☆19Updated 3 months ago
- Coqui Inference Engine☆38Updated 3 years ago
- An unofficial PyTorch implementation of the StreamVC(Real-Time Low-Latency Voice Conversion)☆117Updated 6 months ago
- BurrMill core☆21Updated 3 years ago
- ☆110Updated 7 months ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆23Updated 9 months ago
- Kaldi API for Android, Python and Node. Forked from vosk-api with minimal modifications.☆16Updated 4 years ago
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.☆27Updated 7 months ago
- Wasm Port of Recurrent neural network for audio noise reduction. Based on xiph/rnnoise C++ project☆41Updated 4 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆107Updated 2 years ago