neurlang / gospeakLinks
[unfinished] A Golang Text to Speech System
☆14Updated 3 months ago
Alternatives and similar repositories for gospeak
Users that are interested in gospeak are comparing it to the libraries listed below
Sorting:
- IPA Phonemizer/Dephonemizer for 139 human languages☆35Updated this week
- Faster Whisper ASR transcription with CTranslate2☆23Updated 10 months ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆23Updated this week
- Neural text to speech system that uses eSpeak as a text/phoneme front-end☆15Updated 3 years ago
- Golang bindings for Coqui's speech-to-text library☆34Updated 3 years ago
- ☆34Updated this week
- ☆16Updated 4 months ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆10Updated 3 months ago
- Real-time end-to-end singing voice convertion☆22Updated 10 months ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆21Updated 7 months ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆26Updated last year
- StyleTTS 2 Optimized Training Fork☆33Updated 7 months ago
- StyleTTS2 + Vocos as a Decoder☆13Updated 5 months ago
- On-device streaming text-to-speech engine powered by deep learning☆120Updated 3 weeks ago
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆13Updated 10 months ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆15Updated 2 years ago
- state-of-the-art models for diacritics restoration for Arabic language☆13Updated 6 months ago
- Whisper finetuning☆14Updated 4 months ago
- C++ library for converting text to phonemes for Piper☆132Updated last month
- An unofficial pytorch implementation of "STREAMVC: REAL-TIME LOW-LATENCY VOICE CONVERSION".☆73Updated 4 months ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆39Updated last week
- (WIP) A retrain of F5-TTS on permissively-licensed data☆12Updated 4 months ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆121Updated 3 weeks ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 3 months ago
- Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems☆11Updated 7 months ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆15Updated 11 months ago
- Python implementation of a few speech intelligibility prediction algorithms☆14Updated last year
- High quality text-to-speech based on StyleTTS 2.☆60Updated this week
- Sequence to sequence model for Arabic punctuation prediction.☆12Updated 5 years ago