lukerbs/forcealign

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lukerbs/forcealign)

lukerbs / forcealign

ForceAlign is a Python library for forced alignment of English text to English audio. You can use ForceAlign to get word or phoneme level text alignments of audio, with each word or phoneme's start and end time within the audio. ForceAlign was designed to be easy to install and use, without requiring any third-party, non-Python dependencies.

☆27

Alternatives and similar repositories for forcealign

Users that are interested in forcealign are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

basswood-io / BasswoodAV
View on GitHub
Python bindings for ffmpeg libraries
☆15Jun 24, 2025Updated last year
Mddct / cosyvoice2-flow-optimized
View on GitHub
faster inference
☆27Jan 20, 2025Updated last year
k2-fsa / kaldi-decoder
View on GitHub
Decoders from Kaldi using OpenFst
☆35Apr 10, 2026Updated 3 months ago
awxkee / libmobi-swift
View on GitHub
Package for easy handle mobi books in swift
☆12Feb 5, 2026Updated 5 months ago
danpovey / conditional-flow-matching
View on GitHub
☆29Aug 8, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
kyegomez / FastFF
View on GitHub
Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"
☆16Nov 11, 2024Updated last year
skysbird / g2p-zh-en
View on GitHub
Chinese and English Bilinguish G2P
☆22Jul 16, 2023Updated 3 years ago
JazminVidal / gop-ft
View on GitHub
Transfer learning approach to pronunciation scoring
☆12Jan 17, 2024Updated 2 years ago
MagicHub-io / MagicData-RAMC
View on GitHub
MagicData-RAMC Dataset and Baseline
☆64Sep 13, 2022Updated 3 years ago
Rudrabha / 8X-Super-Resolution
View on GitHub
This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolutio…
☆16Aug 26, 2020Updated 5 years ago
zhengmidon / singaligner
View on GitHub
a compact audio-to-phoneme aligner for singing voice
☆12Jan 17, 2024Updated 2 years ago
kyegomez / MobileVLM
View on GitHub
Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …
☆15Mar 11, 2024Updated 2 years ago
MingLunHan / CIF-HieraDist
View on GitHub
[INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation
☆41Jul 14, 2026Updated last week
jczhang02 / MUSIC_dataset_script
View on GitHub
This repo contains script to download MUSIC dataset from youtube
☆12Jan 19, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Jittor / deeplab-jittor
View on GitHub
☆10May 24, 2020Updated 6 years ago
soskuthy / gamm_strategies
View on GitHub
Supplementary materials for "Evaluating generalised additive mixed modelling strategies for dynamic speech analysis"
☆10Jan 25, 2021Updated 5 years ago
limcheekin / talk-to-ai
View on GitHub
Talk To AI with FastRTC enables natural, real-time voice conversations with AI using WebRTC, offering customizable voices, interfaces, an…
☆47Mar 10, 2025Updated last year
roedoejet / FastSpeech2
View on GitHub
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
☆22Jul 5, 2023Updated 3 years ago
r-three / realistic_evaluation_of_model_merging_for_compositional_generalization
View on GitHub
☆12Feb 11, 2026Updated 5 months ago
osdf / datasets
View on GitHub
Some stuff to handle various datasets
☆15Mar 2, 2018Updated 8 years ago
Priyadarshan2000 / Doc-to-Handwritting
View on GitHub
This is a Document to Handwriting a website using HTML, CSS, JS and Google font API. We type our work in the text box and our work will b…
☆21Feb 14, 2023Updated 3 years ago
rrkarim / unbounded-cache-lm
View on GitHub
Unbounded cache model for online language modeling with open vocabulary
☆11Feb 15, 2019Updated 7 years ago
ictnlp / SLED-TTS
View on GitHub
Streamable Text-to-Speech model using a language modeling approach, without vector quantization
☆108May 20, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
AliceNavigator / SpeakerClassifier
View on GitHub
A lightweight tool that efficiently isolates target speaker data from your datasets.
☆20Nov 23, 2024Updated last year
tuostudy / EXCEL-Word-List-Phonetic-Generation
View on GitHub
EXCEL单词表音标生成（附墨墨词库）
☆16Sep 22, 2022Updated 3 years ago
nitotm / efficient-language-detector-py
View on GitHub
Fast and accurate natural language detection. Detector written in Python. Nito-ELD, ELD.
☆22Updated this week
VITA-Group / SSM-Bottleneck
View on GitHub
[ICLR'25] "Understanding Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing" by Peihao Wang, Ruisi Cai, Yue…
☆18Mar 21, 2025Updated last year
Nikilicious09 / NVPython
View on GitHub
A sample Xcode Project to run Python in Xcode.
☆13Apr 1, 2022Updated 4 years ago
hyyoka / Acoustic-Features
View on GitHub
audio/speech feature extraction using parselmouth, librosa, disvoice
☆10Jan 28, 2022Updated 4 years ago
wq2012 / CurriculumVitae
View on GitHub
Curriculum Vitae of Quan Wang
☆15Dec 13, 2025Updated 7 months ago
LWprogramming / audiolm-pytorch-training
View on GitHub
audiolm-pytorch training code
☆15Jul 31, 2023Updated 2 years ago
archinetai / aligner-pytorch
View on GitHub
Sequence alignement methods with helpers for PyTorch.
☆24Nov 30, 2022Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
openzim / ted
View on GitHub
Provide the best of TED.com for offline usage!
☆20Jun 15, 2026Updated last month
Hannes1 / react-native-wenet
View on GitHub
Wenet speech to text for react native
☆10Nov 1, 2022Updated 3 years ago
AMLAB-Wakayama / gammachirp-filterbank
View on GitHub
An original package of the dynamic compressive gammachirp filterbank (dcGC-FB)
☆14Jul 7, 2026Updated 2 weeks ago
mida-project / eye-tracker-naive
View on GitHub
Tobii Eye Tracker 4C Naïve Solution
☆20Feb 25, 2021Updated 5 years ago
kunato / transnetv2pt
View on GitHub
☆11Sep 30, 2021Updated 4 years ago
huangruizhe / audio
View on GitHub
Data manipulation and transformation for audio signal processing, powered by PyTorch
☆10Sep 30, 2024Updated last year
megseekosh / dsp_tutorials
View on GitHub
I wanted guided tutorials on digital signal processing so I decided to create them. The result is this ebook: "Digital Signal Processing …
☆12Feb 5, 2024Updated 2 years ago