Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port of the DeepPhonemizer model is used. For speech synthesis VITS models are used. Piper models are compatible after a conversion script is run.
β30Mar 9, 2026Updated last week
Alternatives and similar repositories for babylon
Users that are interested in babylon are comparing it to the libraries listed below
Sorting:
- π± Flutter demo app for Arabic TTS ποΈ β ONNX-based offline speech synthesis πβ14May 3, 2025Updated 10 months ago
- ποΈ Arabic TTS models (FastPitch, Mixer-TTS) in the ONNX format β Python package for offline speech synthesis ππ¦β37Feb 25, 2026Updated 3 weeks ago
- Using OpenVINO to speed up MeloTTS inferenceβ15Nov 1, 2024Updated last year
- β33Nov 27, 2021Updated 4 years ago
- mnn tts demo.β19May 7, 2025Updated 10 months ago
- Speech recognition module for Python, supporting several engines and APIs, online and offline.β13Mar 9, 2022Updated 4 years ago
- Java Bindings for the C++ library DeepSpeechβ10Jun 4, 2020Updated 5 years ago
- β13Apr 14, 2024Updated last year
- ESLTTS datasetβ16Feb 6, 2025Updated last year
- Pure C# port of the Pocketsphinx keyword spotterβ13Jan 19, 2020Updated 6 years ago
- Launch your speech synthesis within one minute.β12May 6, 2024Updated last year
- β33Aug 6, 2021Updated 4 years ago
- Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speechβ16Sep 20, 2024Updated last year
- Training code for kokoro tts modelβ36Nov 15, 2025Updated 4 months ago
- zero-shot realtime TTS system, fully offline, free and open sourceβ51Apr 18, 2025Updated 11 months ago
- Assistance component base for Dicio assistant componentsβ13May 27, 2024Updated last year
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open β¦β23Updated this week
- VITS Inference using ONNX Runtime on C++β13Dec 25, 2023Updated 2 years ago
- β40Jan 20, 2025Updated last year
- Openfst mirror with some fixesβ14Aug 23, 2024Updated last year
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcriptsβ16Dec 3, 2024Updated last year
- β18Apr 28, 2021Updated 4 years ago
- wake word spotting with kaldiβ19Dec 3, 2020Updated 5 years ago
- β22Jun 30, 2021Updated 4 years ago
- β57Feb 8, 2026Updated last month
- β40Aug 15, 2021Updated 4 years ago
- A framework for creating voice based agents. Integrations LLMs with speech recognition and text-to-speechβ34May 1, 2024Updated last year
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speechβ¦β17Mar 6, 2023Updated 3 years ago
- KittenTTS is an ultra-lightweight, CPU-friendly text-to-speech model with 15M params for real-time, high-quality voices. Open source, fasβ¦β24Updated this week
- Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-β¦β16Feb 1, 2026Updated last month
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPβ¦β108Updated this week
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawlerβ23Mar 21, 2021Updated 4 years ago
- Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for Gβ¦β20May 21, 2025Updated 9 months ago
- Finally, some decent sample sentencesβ23Dec 3, 2023Updated 2 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IWβ¦β18Nov 30, 2022Updated 3 years ago
- β14Aug 19, 2024Updated last year
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech denβ¦β109Aug 16, 2024Updated last year
- Tiny wrapper around webrtc-audio-processing for noise suppression/auto gain onlyβ33Jul 19, 2024Updated last year
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPAβ18Aug 16, 2024Updated last year