ForceAlign is a Python library for forced alignment of English text to English audio. You can use ForceAlign to get word or phoneme level text alignments of audio, with each word or phoneme's start and end time within the audio. ForceAlign was designed to be easy to install and use, without requiring any third-party, non-Python dependencies.
☆28Dec 4, 2024Updated last year
Alternatives and similar repositories for forcealign
Users that are interested in forcealign are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Decoders from Kaldi using OpenFst☆36Apr 10, 2026Updated 2 months ago
- 🙊Cogified speech-to-text model nvidia/canary-qwen-2.5b (best ASR model according to hf-audio/open_asr_leaderboard as of 18/Jul/2025)🎙️☆22Jul 28, 2025Updated 11 months ago
- Package for easy handle mobi books in swift☆12Feb 5, 2026Updated 4 months ago
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆109May 20, 2025Updated last year
- faster inference☆28Jan 20, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Chinese and English Bilinguish G2P☆22Jul 16, 2023Updated 2 years ago
- Transfer learning approach to pronunciation scoring☆12Jan 17, 2024Updated 2 years ago
- ✒️ LanguageTool integration for Quill.js editors☆17Aug 20, 2024Updated last year
- ☆30Aug 8, 2024Updated last year
- Cochlear implant signal processing☆10Jun 24, 2021Updated 5 years ago
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Nov 11, 2024Updated last year
- MagicData-RAMC Dataset and Baseline☆64Sep 13, 2022Updated 3 years ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Mar 11, 2024Updated 2 years ago
- 专业的网页前端国际化解决方案,JavaScript自动翻译库,用于将网页自动翻译为各国语言,支持智能缓存和可定制的配置。 Professional web front-end internationalization solution, JavaScript automat…☆18Apr 22, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation☆41Sep 1, 2023Updated 2 years ago
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"☆22Jul 5, 2023Updated 2 years ago
- 智谱 Realtime API 接口前端使用样例☆29Sep 3, 2025Updated 9 months ago
- Supplementary materials for "Evaluating generalised additive mixed modelling strategies for dynamic speech analysis"☆10Jan 25, 2021Updated 5 years ago
- EXCEL单词表音标生成(附墨墨词库)☆16Sep 22, 2022Updated 3 years ago
- ☆12Feb 11, 2026Updated 4 months ago
- ☆132Jun 1, 2026Updated last month
- Provide the best of TED.com for offline usage!☆20Jun 15, 2026Updated 2 weeks ago
- A lightweight tool that efficiently isolates target speaker data from your datasets.☆20Nov 23, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- audio/speech feature extraction using parselmouth, librosa, disvoice☆10Jan 28, 2022Updated 4 years ago
- Fast and accurate natural language detection. Detector written in Python. Nito-ELD, ELD.☆22Oct 18, 2023Updated 2 years ago
- [ICLR'25] "Understanding Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing" by Peihao Wang, Ruisi Cai, Yue…☆18Mar 21, 2025Updated last year
- Remove the handwriting of WPI Images with inpainting.☆25Feb 13, 2023Updated 3 years ago
- ☆32May 10, 2024Updated 2 years ago
- audiolm-pytorch training code☆15Jul 31, 2023Updated 2 years ago
- Sequence alignement methods with helpers for PyTorch.☆24Nov 30, 2022Updated 3 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- ☆20Jul 5, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆11Sep 30, 2024Updated last year
- VoxCPM2 TTS for ComfyUI. 30 languages, voice design, controllable cloning, 48kHz audio, and LoRA training☆170Apr 12, 2026Updated 2 months ago
- I wanted guided tutorials on digital signal processing so I decided to create them. The result is this ebook: "Digital Signal Processing …☆12Feb 5, 2024Updated 2 years ago
- ICASSP 2023: "Recursive Joint Attention for Audio-Visual Fusion in Regression Based Emotion Recognition"☆14Nov 29, 2024Updated last year
- ☆16Aug 10, 2025Updated 10 months ago
- A sample Xcode Project to run Python in Xcode.☆13Apr 1, 2022Updated 4 years ago
- Japanese Spelling Correction - JSC☆14Sep 19, 2023Updated 2 years ago