Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port of the DeepPhonemizer model is used. For speech synthesis VITS models are used. Piper models are compatible after a conversion script is run.
β37Apr 14, 2026Updated 2 months ago
Alternatives and similar repositories for babylon
Users that are interested in babylon are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π± Flutter demo app for Arabic TTS ποΈ β ONNX-based offline speech synthesis πβ17May 3, 2025Updated last year
- ποΈ Arabic TTS models (FastPitch, Mixer-TTS) in the ONNX format β Python package for offline speech synthesis ππ¦β43Jun 20, 2026Updated last week
- Using OpenVINO to speed up MeloTTS inferenceβ15Nov 1, 2024Updated last year
- β33Nov 27, 2021Updated 4 years ago
- mnn tts demo.β19May 7, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Speech recognition module for Python, supporting several engines and APIs, online and offline.β13Mar 9, 2022Updated 4 years ago
- Java Bindings for the C++ library DeepSpeechβ10Jun 4, 2020Updated 6 years ago
- β13May 1, 2026Updated last month
- β13Oct 27, 2021Updated 4 years ago
- ESLTTS datasetβ16Feb 6, 2025Updated last year
- Launch your speech synthesis within one minute.β12May 6, 2024Updated 2 years ago
- β33Aug 6, 2021Updated 4 years ago
- Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speechβ16Sep 20, 2024Updated last year
- Training code for kokoro tts modelβ45Nov 15, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- β22Jun 30, 2021Updated 4 years ago
- Assistance component base for Dicio assistant componentsβ13Apr 23, 2026Updated 2 months ago
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open β¦β23May 19, 2026Updated last month
- VITS Inference using ONNX Runtime on C++β13Dec 25, 2023Updated 2 years ago
- β48Jan 20, 2025Updated last year
- zero-shot realtime TTS system, fully offline, free and open sourceβ55Apr 18, 2025Updated last year
- Openfst mirror with some fixesβ16Aug 23, 2024Updated last year
- β18Apr 28, 2021Updated 5 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcriptsβ16Dec 3, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- wake word spotting with kaldiβ19Dec 3, 2020Updated 5 years ago
- β58Feb 8, 2026Updated 4 months ago
- β40Aug 15, 2021Updated 4 years ago
- A framework for creating voice based agents. Integrations LLMs with speech recognition and text-to-speechβ35May 1, 2024Updated 2 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speechβ¦β17Mar 6, 2023Updated 3 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPβ¦β110Mar 15, 2026Updated 3 months ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawlerβ23Mar 21, 2021Updated 5 years ago
- Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for Gβ¦β19May 21, 2025Updated last year
- Finally, some decent sample sentencesβ24Dec 3, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IWβ¦β18Nov 30, 2022Updated 3 years ago
- β14Aug 19, 2024Updated last year
- β40Apr 29, 2024Updated 2 years ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech denβ¦β116Aug 16, 2024Updated last year
- Tiny wrapper around webrtc-audio-processing for noise suppression/auto gain onlyβ33May 28, 2026Updated last month
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPAβ18Aug 16, 2024Updated last year
- Lite Voice Terminal, an "offline smart speaker" solution powered by on-premise ASR server (vosk API / kaldi engine)β19Feb 29, 2024Updated 2 years ago