Mobile-Artificial-Intelligence/babylon

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Mobile-Artificial-Intelligence/babylon)

Mobile-Artificial-Intelligence / babylon

Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port of the DeepPhonemizer model is used. For speech synthesis VITS models are used. Piper models are compatible after a conversion script is run.

☆39

Alternatives and similar repositories for babylon

Users that are interested in babylon are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nipponjo / tts-arabic-flutter
View on GitHub
📱 Flutter demo app for Arabic TTS 🎙️ — ONNX-based offline speech synthesis 🚀
☆17May 3, 2025Updated last year
nipponjo / tts_arabic
View on GitHub
🎙️ Arabic TTS models (FastPitch, Mixer-TTS) in the ONNX format — Python package for offline speech synthesis 🚀📦
☆44Jun 20, 2026Updated last month
zhaohb / MeloTTS-OV
View on GitHub
Using OpenVINO to speed up MeloTTS inference
☆15Nov 1, 2024Updated last year
wangzhaode / mnn-tts
View on GitHub
mnn tts demo.
☆19May 7, 2025Updated last year
NeuralVox / OpenPhonemizer
View on GitHub
An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…
☆111Mar 15, 2026Updated 4 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
k2-fsa / colab
View on GitHub
Colab notebooks for Next-gen Kaldi
☆31Oct 12, 2025Updated 9 months ago
avryhof / speech_recognition
View on GitHub
Speech recognition module for Python, supporting several engines and APIs, online and offline.
☆13Mar 9, 2022Updated 4 years ago
mikex86 / DeepSpeech-Java-Bindings
View on GitHub
Java Bindings for the C++ library DeepSpeech
☆10Jun 4, 2020Updated 6 years ago
42io / tflite_kws
View on GitHub
☆13May 1, 2026Updated 2 months ago
facebookresearch / llama-hd-dataset
View on GitHub
This is a balanced dataset for English homograph disambiguation (HD), generated with Meta's Llama 2-Chat 70B model.
☆22Jan 22, 2024Updated 2 years ago
daanzu / wenet_stt_python
View on GitHub
☆33Nov 27, 2021Updated 4 years ago
yuhangear / wenet-android
View on GitHub
☆13Oct 27, 2021Updated 4 years ago
mushanshanshan / ESLTTS
View on GitHub
ESLTTS dataset
☆16Feb 6, 2025Updated last year
Deep-unlearning / Llasa-GRPO
View on GitHub
☆18Nov 19, 2025Updated 8 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
talker93 / oneMinTTS
View on GitHub
Launch your speech synthesis within one minute.
☆12May 6, 2024Updated 2 years ago
Mashiro009 / wenet-onnx
View on GitHub
☆33Aug 6, 2021Updated 4 years ago
mush42 / libtashkeel
View on GitHub
Add Arabic diacritics (tashkeel/harakat) using Rust/Python/C++/WASM and NLP models
☆50Oct 4, 2025Updated 9 months ago
IntendedConsequence / vadc
View on GitHub
Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech
☆16Sep 20, 2024Updated last year
alumae / torch-xvectors-wav
View on GitHub
☆22Jun 30, 2021Updated 5 years ago
sitelift / Chirp
View on GitHub
Free, local voice-to-text for Windows & macOS. No cloud, no account, no subscription.
☆15Jul 11, 2026Updated last week
DicioTeam / dicio-skill
View on GitHub
Assistance component base for Dicio assistant components
☆13Apr 23, 2026Updated 2 months ago
seanghay / vits.cpp
View on GitHub
VITS Inference using ONNX Runtime on C++
☆13Dec 25, 2023Updated 2 years ago
Open-Speech-EkStep / crowdsource-dataplatform
View on GitHub
This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…
☆17Mar 6, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
gooofy / zerovox
View on GitHub
zero-shot realtime TTS system, fully offline, free and open source
☆55Apr 18, 2025Updated last year
locaith / bio-memory-ai-locaith
View on GitHub
🧠 Bio-Agent OS: 🇻🇳 Bio-Inspired Memory Framework for AI Agents (OpenClaw/ERP). Researched & Developed by Dev Tuan Anh Ha (Locaith Solu…
☆20Apr 21, 2026Updated 3 months ago
cpii-cai / PunCantonese
View on GitHub
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆15Dec 3, 2024Updated last year
Open-Speech-EkStep / data-acquisition-pipeline
View on GitHub
☆18Apr 28, 2021Updated 5 years ago
JarbasAl / kaldi_spotter
View on GitHub
wake word spotting with kaldi
☆19Dec 3, 2020Updated 5 years ago
DaveDeCaprio / voice-stream
View on GitHub
A framework for creating voice based agents. Integrations LLMs with speech recognition and text-to-speech
☆35May 1, 2024Updated 2 years ago
Mashiro009 / wenet-online-decoder-onnx
View on GitHub
☆40Aug 15, 2021Updated 4 years ago
Vyvo-Labs / CodecHub
View on GitHub
CodecHub: A Unified Library for Codec Models
☆25Dec 24, 2025Updated 6 months ago
Otosaku / OtosakuKWS-iOS
View on GitHub
Lightweight on-device keyword spotting engine for iOS using CoreML and real-time audio streaming.
☆16Jun 14, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Prem-kumar27 / Fast-KTSpeechCrawler
View on GitHub
Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler
☆23Mar 21, 2021Updated 5 years ago
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
hecko-yes / tts-dataset-prompts
View on GitHub
Finally, some decent sample sentences
☆24Dec 3, 2023Updated 2 years ago
MahtaFetrat / LLM-Powered-G2P
View on GitHub
Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for G…
☆19May 21, 2025Updated last year
mzboito / IWSLT2022_Tamasheq_data
View on GitHub
Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…
☆18Nov 30, 2022Updated 3 years ago
lars76 / fastspeech2-clean
View on GitHub
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA
☆18Aug 16, 2024Updated last year
EMRAI / emrai-synthetic-diarization-corpus
View on GitHub
☆22Sep 24, 2018Updated 7 years ago