readbeyond/aeneas

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/readbeyond/aeneas)

readbeyond / aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

☆2,852

Alternatives and similar repositories for aeneas

Users that are interested in aeneas are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

pettarin / forced-alignment-tools
View on GitHub
A collection of links and notes on forced alignment tools
☆941Apr 18, 2026Updated 3 months ago
strob / gentle
View on GitHub
gentle forced aligner
☆1,703May 19, 2026Updated 2 months ago
MontrealCorpusTools / Montreal-Forced-Aligner
View on GitHub
Command line utility for forced alignment using Kaldi
☆1,847Jul 11, 2026Updated last week
mozilla / DSAlign
View on GitHub
DeepSpeech based forced alignment tool
☆239Dec 12, 2020Updated 5 years ago
r4victor / syncabook
View on GitHub
📖🎧 A tool for creating ebooks with synchronized text and audio (EPUB3 with Media Overlays)
☆352Jul 10, 2026Updated last week
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
r4victor / afaligner
View on GitHub
📈 A forced aligner intended for synchronization of narrated text
☆104Aug 9, 2025Updated 11 months ago
prosodylab / Prosodylab-Aligner
View on GitHub
Python interface for forced audio alignment using HTK and SoX
☆351Jun 28, 2020Updated 6 years ago
ozdefir / finetuneas
View on GitHub
An HTML interface for finetuning the sync map output from aeneas
☆53Jul 5, 2022Updated 4 years ago
bootphon / phonemizer
View on GitHub
Simple text to phones converter for multiple languages
☆1,557Sep 26, 2024Updated last year
readbeyond / menestrello
View on GitHub
Menestrello is the perfect app for reading+listening Audio-eBooks.
☆28May 25, 2015Updated 11 years ago
lumaku / ctc-segmentation
View on GitHub
Segment an audio file and obtain utterance alignments. (Python package)
☆348May 15, 2024Updated 2 years ago
smacke / ffsubsync
View on GitHub
Automagically synchronize subtitles with video.
☆7,792Updated this week
buriburisuri / speech-to-text-wavenet
View on GitHub
Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow
☆4,005Oct 8, 2021Updated 4 years ago
feldberlin / timething
View on GitHub
Timething is a library for aligning text transcripts with their audio recordings.
☆131Dec 3, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mozilla / DeepSpeech
View on GitHub
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Ras…
☆26,772Jun 19, 2025Updated last year
kaldi-asr / kaldi
View on GitHub
kaldi-asr/kaldi is the official location of the Kaldi project.
☆15,432Sep 22, 2025Updated 9 months ago
zzw922cn / Automatic_Speech_Recognition
View on GitHub
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
☆2,834Mar 24, 2023Updated 3 years ago
r4victor / synclibrivox
View on GitHub
📦 A collection of files for LibriVox recordings to produce ebooks with synchronized text and audio
☆28Jun 5, 2020Updated 6 years ago
Kyubyong / g2p
View on GitHub
g2p: English Grapheme To Phoneme Conversion
☆927Jan 5, 2023Updated 3 years ago
espnet / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆9,897Updated this week
sotelo / parrot
View on GitHub
RNN-based generative models for speech.
☆607Jun 23, 2017Updated 9 years ago
saurabhshri / CCAligner
View on GitHub
🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.
☆173Oct 27, 2019Updated 6 years ago
open-speech / speech-aligner
View on GitHub
speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…
☆410Apr 8, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
maxrmorrison / pyfoal
View on GitHub
Python forced alignment
☆95Apr 12, 2024Updated 2 years ago
mozilla / TTS
View on GitHub
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
☆10,162Nov 9, 2023Updated 2 years ago
m-bain / whisperX
View on GitHub
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆23,143Jul 13, 2026Updated last week
wiseman / py-webrtcvad
View on GitHub
Python interface to the WebRTC Voice Activity Detector
☆2,491Jul 4, 2024Updated 2 years ago
tyiannak / pyAudioAnalysis
View on GitHub
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
☆6,252Aug 4, 2025Updated 11 months ago
lingjzhu / charsiu
View on GitHub
Charsiu: A neural phonetic aligner.
☆346Sep 19, 2022Updated 3 years ago
Kyubyong / tacotron
View on GitHub
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
☆1,833Jan 17, 2022Updated 4 years ago
Rayhane-mamah / Tacotron-2
View on GitHub
DeepMind's Tacotron-2 Tensorflow implementation
☆2,323Jul 6, 2023Updated 3 years ago
CSTR-Edinburgh / merlin
View on GitHub
This is now the official location of the Merlin project.
☆1,320Mar 3, 2020Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ibab / tensorflow-wavenet
View on GitHub
A TensorFlow implementation of DeepMind's WaveNet paper
☆5,428Jul 12, 2023Updated 3 years ago
fatchord / WaveRNN
View on GitHub
WaveRNN Vocoder + TTS
☆2,187Jul 2, 2022Updated 4 years ago
seungwonpark / melgan
View on GitHub
MelGAN vocoder (compatible with NVIDIA/tacotron2)
☆650Oct 3, 2020Updated 5 years ago
r9y9 / deepvoice3_pytorch
View on GitHub
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
☆1,978Dec 19, 2023Updated 2 years ago
jiaaro / pydub
View on GitHub
Manipulate audio with a simple and easy high level interface
☆9,779Mar 19, 2026Updated 4 months ago
Uberi / speech_recognition
View on GitHub
Speech recognition module for Python, supporting several engines and APIs, online and offline.
☆8,975Jun 16, 2026Updated last month
NVIDIA / waveglow
View on GitHub
A Flow-based Generative Network for Speech Synthesis
☆2,340Oct 19, 2023Updated 2 years ago