Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port of the DeepPhonemizer model is used. For speech synthesis VITS models are used. Piper models are compatible after a conversion script is run.
β33Apr 14, 2026Updated 2 weeks ago
Alternatives and similar repositories for babylon
Users that are interested in babylon are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π± Flutter demo app for Arabic TTS ποΈ β ONNX-based offline speech synthesis πβ16May 3, 2025Updated 11 months ago
- ποΈ Arabic TTS models (FastPitch, Mixer-TTS) in the ONNX format β Python package for offline speech synthesis ππ¦β38Feb 25, 2026Updated 2 months ago
- Using OpenVINO to speed up MeloTTS inferenceβ15Nov 1, 2024Updated last year
- β33Nov 27, 2021Updated 4 years ago
- mnn tts demo.β19May 7, 2025Updated 11 months ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Speech recognition module for Python, supporting several engines and APIs, online and offline.β13Mar 9, 2022Updated 4 years ago
- β13Apr 14, 2024Updated 2 years ago
- β13Oct 27, 2021Updated 4 years ago
- ESLTTS datasetβ16Feb 6, 2025Updated last year
- Launch your speech synthesis within one minute.β12May 6, 2024Updated last year
- β33Aug 6, 2021Updated 4 years ago
- Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speechβ16Sep 20, 2024Updated last year
- Training code for kokoro tts modelβ38Nov 15, 2025Updated 5 months ago
- zero-shot realtime TTS system, fully offline, free and open sourceβ52Apr 18, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Assistance component base for Dicio assistant componentsβ13Updated this week
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open β¦β23Apr 13, 2026Updated 2 weeks ago
- VITS Inference using ONNX Runtime on C++β13Dec 25, 2023Updated 2 years ago
- β44Jan 20, 2025Updated last year
- Openfst mirror with some fixesβ15Aug 23, 2024Updated last year
- β18Apr 28, 2021Updated 5 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcriptsβ16Dec 3, 2024Updated last year
- wake word spotting with kaldiβ19Dec 3, 2020Updated 5 years ago
- β59Feb 8, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- β22Jun 30, 2021Updated 4 years ago
- β40Aug 15, 2021Updated 4 years ago
- A framework for creating voice based agents. Integrations LLMs with speech recognition and text-to-speechβ35May 1, 2024Updated last year
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speechβ¦β17Mar 6, 2023Updated 3 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPβ¦β108Mar 15, 2026Updated last month
- Finally, some decent sample sentencesβ23Dec 3, 2023Updated 2 years ago
- Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for Gβ¦β20May 21, 2025Updated 11 months ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IWβ¦β18Nov 30, 2022Updated 3 years ago
- β14Aug 19, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- β40Apr 29, 2024Updated 2 years ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech denβ¦β111Aug 16, 2024Updated last year
- Tiny wrapper around webrtc-audio-processing for noise suppression/auto gain onlyβ33Jul 19, 2024Updated last year
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPAβ18Aug 16, 2024Updated last year
- Lite Voice Terminal, an "offline smart speaker" solution powered by on-premise ASR server (vosk API / kaldi engine)β17Feb 29, 2024Updated 2 years ago
- β21Sep 24, 2018Updated 7 years ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"β22Jun 7, 2025Updated 10 months ago