Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port of the DeepPhonemizer model is used. For speech synthesis VITS models are used. Piper models are compatible after a conversion script is run.
β35Apr 14, 2026Updated last month
Alternatives and similar repositories for babylon
Users that are interested in babylon are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π± Flutter demo app for Arabic TTS ποΈ β ONNX-based offline speech synthesis πβ17May 3, 2025Updated last year
- ποΈ Arabic TTS models (FastPitch, Mixer-TTS) in the ONNX format β Python package for offline speech synthesis ππ¦β41Feb 25, 2026Updated 3 months ago
- Using OpenVINO to speed up MeloTTS inferenceβ15Nov 1, 2024Updated last year
- β33Nov 27, 2021Updated 4 years ago
- mnn tts demo.β19May 7, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Speech recognition module for Python, supporting several engines and APIs, online and offline.β13Mar 9, 2022Updated 4 years ago
- Java Bindings for the C++ library DeepSpeechβ10Jun 4, 2020Updated 6 years ago
- β13May 1, 2026Updated last month
- β13Oct 27, 2021Updated 4 years ago
- ESLTTS datasetβ16Feb 6, 2025Updated last year
- Launch your speech synthesis within one minute.β12May 6, 2024Updated 2 years ago
- β33Aug 6, 2021Updated 4 years ago
- Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speechβ16Sep 20, 2024Updated last year
- Training code for kokoro tts modelβ42Nov 15, 2025Updated 6 months ago
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- β22Jun 30, 2021Updated 4 years ago
- Assistance component base for Dicio assistant componentsβ13Apr 23, 2026Updated last month
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open β¦β23May 19, 2026Updated 3 weeks ago
- zero-shot realtime TTS system, fully offline, free and open sourceβ54Apr 18, 2025Updated last year
- Openfst mirror with some fixesβ15Aug 23, 2024Updated last year
- β18Apr 28, 2021Updated 5 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts