Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port of the DeepPhonemizer model is used. For speech synthesis VITS models are used. Piper models are compatible after a conversion script is run.
β34Apr 14, 2026Updated last month
Alternatives and similar repositories for babylon
Users that are interested in babylon are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π± Flutter demo app for Arabic TTS ποΈ β ONNX-based offline speech synthesis πβ16May 3, 2025Updated last year
- ποΈ Arabic TTS models (FastPitch, Mixer-TTS) in the ONNX format β Python package for offline speech synthesis ππ¦β38Feb 25, 2026Updated 2 months ago
- Using OpenVINO to speed up MeloTTS inferenceβ15Nov 1, 2024Updated last year
- β33Nov 27, 2021Updated 4 years ago
- mnn tts demo.β19May 7, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Speech recognition module for Python, supporting several engines and APIs, online and offline.β13Mar 9, 2022Updated 4 years ago
- Java Bindings for the C++ library DeepSpeechβ10Jun 4, 2020Updated 5 years ago
- β13May 1, 2026Updated 2 weeks ago
- ESLTTS datasetβ16Feb 6, 2025Updated last year
- Launch your speech synthesis within one minute.β12May 6, 2024Updated 2 years ago
- β33Aug 6, 2021Updated 4 years ago
- Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speechβ16Sep 20, 2024Updated last year
- Training code for kokoro tts modelβ40Nov 15, 2025Updated 6 months ago
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open β¦β23May 13, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- VITS Inference using ONNX Runtime on C++β13Dec 25, 2023Updated 2 years ago
- zero-shot realtime TTS system, fully offline, free and open sourceβ53Apr 18, 2025Updated last year
- Openfst mirror with some fixesβ15Aug 23, 2024Updated last year
- β18Apr 28, 2021Updated 5 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcriptsβ16Dec 3, 2024Updated last year
- wake word spotting with kaldiβ19Dec 3, 2020Updated 5 years ago
- β59Feb 8, 2026Updated 3 months ago
- β22Jun 30, 2021Updated 4 years ago
- β40Aug 15, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A framework for creating voice based agents. Integrations LLMs with speech recognition and text-to-speechβ35May 1, 2024Updated 2 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speechβ¦β17Mar 6, 2023Updated 3 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPβ¦β108Mar 15, 2026Updated 2 months ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawlerβ23Mar 21, 2021Updated 5 years ago
- Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for Gβ¦β20May 21, 2025Updated 11 months ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IWβ¦β18Nov 30, 2022Updated 3 years ago
- β14Aug 19, 2024Updated last year
- β40Apr 29, 2024Updated 2 years ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech denβ¦β115Aug 16, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Tiny wrapper around webrtc-audio-processing for noise suppression/auto gain onlyβ33Jul 19, 2024Updated last year
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPAβ18Aug 16, 2024Updated last year
- Lite Voice Terminal, an "offline smart speaker" solution powered by on-premise ASR server (vosk API / kaldi engine)β17Feb 29, 2024Updated 2 years ago
- β21Sep 24, 2018Updated 7 years ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"β22Jun 7, 2025Updated 11 months ago
- Unofficial implementation of ConvNeXt-TTS powered by lightningβ18Oct 20, 2024Updated last year
- Forced alignment decoder for Whisper.β16Mar 13, 2024Updated 2 years ago