Resources that make every language unique
☆28May 8, 2026Updated 2 weeks ago
Alternatives and similar repositories for awesome-speech
Users that are interested in awesome-speech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…☆20May 12, 2023Updated 3 years ago
- Vosk Speech Recognition Plugin for Nativescript☆20Oct 30, 2021Updated 4 years ago
- Dart plugin wrapping the Sherpa-ONNX runtime. Contains example for speech recognition with Flutter☆22Jan 3, 2025Updated last year
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …☆16Sep 5, 2017Updated 8 years ago
- Neural network sequence labeling model☆11Dec 28, 2019Updated 6 years ago
- Keyword extraction using Scake, KeyBERT, Fine-tuning Transformer BERT-like models and ChatGPT.☆12May 22, 2023Updated 3 years ago
- phone inventory library☆17May 15, 2023Updated 3 years ago
- Indonesian speech/phoneme recognizer powered by Kaldi 2.0 (lhotse, icefall, sherpa).☆15Jun 30, 2023Updated 2 years ago
- ☆13Dec 7, 2022Updated 3 years ago
- Tools for the automatic detection of speech-related inhalation events and characterisation of the speech respiratory cycle.☆11Feb 17, 2024Updated 2 years ago
- CDER (Conversational Diarization Error Rate) Scoring Tool☆22Sep 13, 2022Updated 3 years ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Thai smart home corpus with "Gowajee" hotword☆19Jul 30, 2023Updated 2 years ago
- ☆21Jul 22, 2022Updated 3 years ago
- Text To Speech Synthesis with Vosk☆262Mar 14, 2026Updated 2 months ago
- Openfst mirror with some fixes☆15Aug 23, 2024Updated last year
- Word Error Rate Estimation☆16Aug 25, 2020Updated 5 years ago
- Russian speech technology links☆396Mar 17, 2026Updated 2 months ago
- This is a text-processing frontend that converts graphemes to phonemes and then further converts those phonemes into articulatory feature…☆14Sep 23, 2024Updated last year
- ☆14Aug 19, 2024Updated last year
- Bilingual-TTS (Japanese and Korean)☆32Jul 1, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆25Jan 14, 2021Updated 5 years ago
- ASR on WS, POST/GET FAST_API Can use many RU asr models.☆19May 12, 2026Updated last week
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- Using OpenVINO to speed up MeloTTS inference☆15Nov 1, 2024Updated last year
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆23Jun 7, 2025Updated 11 months ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- Forced alignment decoder for Whisper.☆16Mar 13, 2024Updated 2 years ago
- Рускоговорящий GLaDOS анти-ассистент☆14Jun 23, 2024Updated last year
- StyleTTS2 + Vocos as a Decoder☆13Mar 24, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Reimplementation of Miipher☆30Aug 16, 2023Updated 2 years ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Apr 17, 2024Updated 2 years ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆23Dec 5, 2022Updated 3 years ago
- List of direct speech-to-speech translation papers.☆39Jan 31, 2023Updated 3 years ago
- 🧡 Hacker News summaries☆22Apr 10, 2024Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Apr 22, 2026Updated last month
- Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)☆23Nov 12, 2025Updated 6 months ago