Resources that make every language unique
☆27Feb 21, 2026Updated last month
Alternatives and similar repositories for awesome-speech
Users that are interested in awesome-speech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…☆20May 12, 2023Updated 2 years ago
- Vosk Speech Recognition Plugin for Nativescript☆20Oct 30, 2021Updated 4 years ago
- ARCHIVED INDEFINITELY (for api design reasons). Compose Multiplatform router-style navigation library, based on Decompose, with custom an…☆12Nov 4, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Dart plugin wrapping the Sherpa-ONNX runtime. Contains example for speech recognition with Flutter☆22Jan 3, 2025Updated last year
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 10 months ago
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …☆16Sep 5, 2017Updated 8 years ago
- Neural network sequence labeling model☆11Dec 28, 2019Updated 6 years ago
- phone inventory library☆17May 15, 2023Updated 2 years ago
- Keyword extraction using Scake, KeyBERT, Fine-tuning Transformer BERT-like models and ChatGPT.☆12May 22, 2023Updated 2 years ago
- Indonesian speech/phoneme recognizer powered by Kaldi 2.0 (lhotse, icefall, sherpa).☆15Jun 30, 2023Updated 2 years ago
- ☆13Dec 7, 2022Updated 3 years ago
- Tools for the automatic detection of speech-related inhalation events and characterisation of the speech respiratory cycle.☆11Feb 17, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- CDER (Conversational Diarization Error Rate) Scoring Tool☆22Sep 13, 2022Updated 3 years ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- Thai smart home corpus with "Gowajee" hotword☆18Jul 30, 2023Updated 2 years ago
- ☆20Jul 22, 2022Updated 3 years ago
- Decompose BottomSheet Sample☆14Feb 16, 2026Updated last month
- Text To Speech Synthesis with Vosk☆252Mar 14, 2026Updated last week
- Openfst mirror with some fixes☆15Aug 23, 2024Updated last year
- Yet Another Kotlin COmpose Validation library☆20Nov 4, 2025Updated 4 months ago
- Word Error Rate Estimation☆16Aug 25, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Russian speech technology links☆370Mar 17, 2026Updated last week
- This is a text-processing frontend that converts graphemes to phonemes and then further converts those phonemes into articulatory feature…☆14Sep 23, 2024Updated last year
- ☆14Aug 19, 2024Updated last year
- Bilingual-TTS (Japanese and Korean)☆32Jul 1, 2023Updated 2 years ago
- ☆24Jan 14, 2021Updated 5 years ago
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆22Dec 5, 2022Updated 3 years ago
- Using OpenVINO to speed up MeloTTS inference☆15Nov 1, 2024Updated last year
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Jun 7, 2025Updated 9 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- Рускоговорящий GLaDOS анти-ассистент☆14Jun 23, 2024Updated last year
- Forced alignment decoder for Whisper.☆15Mar 13, 2024Updated 2 years ago
- StyleTTS2 + Vocos as a Decoder☆13Mar 24, 2025Updated last year
- Reimplementation of Miipher☆29Aug 16, 2023Updated 2 years ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Apr 17, 2024Updated last year
- List of direct speech-to-speech translation papers.☆38Jan 31, 2023Updated 3 years ago