Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port of the DeepPhonemizer model is used. For speech synthesis VITS models are used. Piper models are compatible after a conversion script is run.
β31Mar 9, 2026Updated last month
Alternatives and similar repositories for babylon
Users that are interested in babylon are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π± Flutter demo app for Arabic TTS ποΈ β ONNX-based offline speech synthesis πβ15May 3, 2025Updated 11 months ago
- ποΈ Arabic TTS models (FastPitch, Mixer-TTS) in the ONNX format β Python package for offline speech synthesis ππ¦β37Feb 25, 2026Updated last month
- Using OpenVINO to speed up MeloTTS inferenceβ15Nov 1, 2024Updated last year
- β33Nov 27, 2021Updated 4 years ago
- mnn tts demo.β19May 7, 2025Updated 11 months ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A light-weight HTTP proxy server written in Objective-C γUNFINISHEDγβ10Jul 28, 2016Updated 9 years ago
- Speech recognition module for Python, supporting several engines and APIs, online and offline.β13Mar 9, 2022Updated 4 years ago
- Java Bindings for the C++ library DeepSpeechβ10Jun 4, 2020Updated 5 years ago
- β13Apr 14, 2024Updated last year
- β13Oct 27, 2021Updated 4 years ago
- ESLTTS datasetβ16Feb 6, 2025Updated last year
- A version of LwIP enhanced to support Nest OpenWeave running on the ESP32β15Apr 30, 2020Updated 5 years ago
- eSpeak-NG wrapper to Swift Package Managerβ13Oct 11, 2025Updated 5 months ago
- Launch your speech synthesis within one minute.β12May 6, 2024Updated last year
- NordVPN Special Discount Offer β’ AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- β33Aug 6, 2021Updated 4 years ago
- Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speechβ16Sep 20, 2024Updated last year
- Training code for kokoro tts modelβ37Nov 15, 2025Updated 4 months ago
- zero-shot realtime TTS system, fully offline, free and open sourceβ51Apr 18, 2025Updated 11 months ago
- Assistance component base for Dicio assistant componentsβ13May 27, 2024Updated last year
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open β¦β23Mar 17, 2026Updated 3 weeks ago
- VITS Inference using ONNX Runtime on C++β13Dec 25, 2023Updated 2 years ago
- Openfst mirror with some fixesβ15Aug 23, 2024Updated last year
- β18Apr 28, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways β’ AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcriptsβ16Dec 3, 2024Updated last year
- wake word spotting with kaldiβ19Dec 3, 2020Updated 5 years ago
- β22Jun 30, 2021Updated 4 years ago
- β58Feb 8, 2026Updated 2 months ago
- Small Cocoa CSV file parser (see link for the official repository on github).β33Jun 4, 2019Updated 6 years ago
- β40Aug 15, 2021Updated 4 years ago
- An I2C bootloader for ATTiny devices based on AVR112.β18Apr 30, 2024Updated last year
- A framework for creating voice based agents. Integrations LLMs with speech recognition and text-to-speechβ34May 1, 2024Updated last year
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speechβ¦β17Mar 6, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPβ¦β108Mar 15, 2026Updated 3 weeks ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawlerβ23Mar 21, 2021Updated 5 years ago
- Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for Gβ¦β20May 21, 2025Updated 10 months ago
- Finally, some decent sample sentencesβ23Dec 3, 2023Updated 2 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IWβ¦β18Nov 30, 2022Updated 3 years ago
- β14Aug 19, 2024Updated last year
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech denβ¦β110Aug 16, 2024Updated last year