rhasspy / glow-speakLinks
Neural text to speech system that uses eSpeak as a text/phoneme front-end
☆16Updated 4 years ago
Alternatives and similar repositories for glow-speak
Users that are interested in glow-speak are comparing it to the libraries listed below
Sorting:
- Robust Speech Recognition via Large-Scale Weak Supervision☆29Updated 2 years ago
- Launch your speech synthesis within one minute.☆12Updated last year
- Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for G…☆17Updated 7 months ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20Updated 7 months ago
- An even smaller speech recognizer / force aligner☆37Updated last year
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆14Updated last month
- TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.☆11Updated 5 years ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆13Updated 9 months ago
- My guide to create an italian TTS with Coqui☆14Updated 3 years ago
- Open tools and data for cloudless automatic speech recognition☆11Updated 6 years ago
- Persian Grapheme-to-Phoneme (G2P) converter☆21Updated 5 years ago
- Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS☆25Updated last month
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆46Updated 2 years ago
- Audio De-Noiser using a Convolutional Neural Network Architecture built with Tensorflow.js☆21Updated 2 years ago
- zero-shot realtime TTS system, fully offline, free and open source☆50Updated 8 months ago
- Project of Singing Voice Conversion.☆15Updated 2 years ago
- Docker for building an environment for Dutch online and offline ASR.☆12Updated 4 years ago
- Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregr…☆28Updated 2 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Updated 2 years ago
- Transfer learning approach to pronunciation scoring☆11Updated last year
- Evaluation of STT models for german language☆15Updated 3 years ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated 2 years ago
- Russian accentuator and IPA transcriber☆16Updated last year
- The EveryVoice TTS Toolkit - Text To Speech for your language☆41Updated 3 weeks ago
- ☆52Updated last week
- Deepspeech ASR Model for the Catalan Language☆17Updated 4 years ago
- A free & open tool for transcribing audio interviews with offline ASR support☆25Updated 2 years ago
- ☆38Updated last year
- Streaming Audio Models Examples in JS☆19Updated last year