Uses ctypes and libespeak-ng to transform test into IPA phonemes
☆26Sep 20, 2023Updated 2 years ago
Alternatives and similar repositories for espeak-phonemizer
Users that are interested in espeak-phonemizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Converts Mandarin Chinese pinyin notation to IPA (international phonetic alphabet) notation☆18Nov 28, 2023Updated 2 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- Convert native orthographies to the International Phonetic Alphabet☆18Jul 4, 2025Updated 9 months ago
- vosk wake word plugin for OpenVoiceOS☆12Apr 8, 2026Updated last week
- A free & open tool for transcribing audio interviews with offline ASR support☆25Dec 21, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Evaluation of STT models for german language☆15Jan 22, 2022Updated 4 years ago
- ☆12Nov 12, 2024Updated last year
- Code for paper A3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing☆89Sep 6, 2024Updated last year
- Awesome stuff made by the Mycroft community☆13Sep 16, 2021Updated 4 years ago
- Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset☆15Apr 7, 2025Updated last year
- ☆10Jul 27, 2021Updated 4 years ago
- ☆14Sep 21, 2022Updated 3 years ago
- A streaming Speech to Text server using DeepSpeech☆16May 10, 2020Updated 5 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A simple tutorial on setting up Sparrowhawk - a text-to-speech normalization engine☆14Oct 16, 2017Updated 8 years ago
- ☆14Jan 2, 2025Updated last year
- A Mycroft.AI skill for a collection of Mycroft units to communicate with each other.☆16May 4, 2021Updated 4 years ago
- Coqui Inference Engine☆41Aug 3, 2021Updated 4 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆331Nov 15, 2024Updated last year
- Dataset, code and results repository for SBA-Net.☆14Sep 23, 2022Updated 3 years ago
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆100Nov 20, 2023Updated 2 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80May 20, 2023Updated 2 years ago
- Grapheme to phoneme model for PyTorch☆43Jul 21, 2022Updated 3 years ago
- The History of Speech Recognition to the Year 2030☆13Aug 14, 2021Updated 4 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit.☆10Feb 29, 2024Updated 2 years ago
- ☆17Aug 27, 2025Updated 7 months ago
- Some basic tools for interacting with `tcf-agent`☆11Jan 19, 2024Updated 2 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- ☆16Nov 19, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆43Aug 3, 2022Updated 3 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 5 years ago
- A collection of small corpuses of interesting data for the creation of bots and similar stuff.☆10Sep 26, 2018Updated 7 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Dec 2, 2022Updated 3 years ago
- Official code for Wav2Seq☆97Jul 19, 2022Updated 3 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 4 years ago
- The python class was used on Dirble.com to get titles and bitrates on shoutcast and icecast streams. Will not be maintained anymore but d…☆19May 15, 2021Updated 4 years ago