ForceAlign is a Python library for forced alignment of English text to English audio. You can use ForceAlign to get word or phoneme level text alignments of audio, with each word or phoneme's start and end time within the audio. ForceAlign was designed to be easy to install and use, without requiring any third-party, non-Python dependencies.
☆28Dec 4, 2024Updated last year
Alternatives and similar repositories for forcealign
Users that are interested in forcealign are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Decoders from Kaldi using OpenFst☆35Apr 10, 2026Updated last month
- 🙊Cogified speech-to-text model nvidia/canary-qwen-2.5b (best ASR model according to hf-audio/open_asr_leaderboard as of 18/Jul/2025)🎙️☆21Jul 28, 2025Updated 9 months ago
- Package for easy handle mobi books in swift☆12Feb 5, 2026Updated 3 months ago
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆109May 20, 2025Updated last year
- faster inference☆28Jan 20, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Chinese and English Bilinguish G2P☆22Jul 16, 2023Updated 2 years ago
- Transfer learning approach to pronunciation scoring☆12Jan 17, 2024Updated 2 years ago
- ✒️ LanguageTool integration for Quill.js editors☆17Aug 20, 2024Updated last year
- ☆30Aug 8, 2024Updated last year
- 专业的网页前端国际化解决方案,JavaScript自动翻译库,用于将网页自动翻译为各国语言,支持智能缓存和可定制的配置。 Professional web front-end internationalization solution, JavaScript automat…☆17Apr 22, 2025Updated last year
- Cochlear implant signal processing☆10Jun 24, 2021Updated 4 years ago
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Nov 11, 2024Updated last year
- MagicData-RAMC Dataset and Baseline☆60Sep 13, 2022Updated 3 years ago
- a compact audio-to-phoneme aligner for singing voice☆12Jan 17, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Mar 11, 2024Updated 2 years ago
- [INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation☆41Sep 1, 2023Updated 2 years ago
- This is a Document to Handwriting a website using HTML, CSS, JS and Google font API. We type our work in the text box and our work will b…☆21Feb 14, 2023Updated 3 years ago
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"☆22Jul 5, 2023Updated 2 years ago
- 智谱 Realtime API 接口前端使用样例☆28Sep 3, 2025Updated 8 months ago
- Supplementary materials for "Evaluating generalised additive mixed modelling strategies for dynamic speech analysis"☆10Jan 25, 2021Updated 5 years ago
- EXCEL单词表音标生成(附墨墨词库)☆16Sep 22, 2022Updated 3 years ago
- ☆12Feb 11, 2026Updated 3 months ago
- ☆132May 4, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Provide the best of TED.com for offline usage!☆20May 5, 2026Updated 2 weeks ago
- A lightweight tool that efficiently isolates target speaker data from your datasets.☆20Nov 23, 2024Updated last year
- audio/speech feature extraction using parselmouth, librosa, disvoice☆10Jan 28, 2022Updated 4 years ago
- [ICLR'25] "Understanding Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing" by Peihao Wang, Ruisi Cai, Yue…☆18Mar 21, 2025Updated last year
- Fast and accurate natural language detection. Detector written in Python. Nito-ELD, ELD.☆21Oct 18, 2023Updated 2 years ago
- Etymology Viewer that gives you the origin of a word☆11May 3, 2026Updated 2 weeks ago
- VoxCPM2 TTS for ComfyUI. 30 languages, voice design, controllable cloning, 48kHz audio, and LoRA training☆115Apr 12, 2026Updated last month
- Remove the handwriting of WPI Images with inpainting.☆24Feb 13, 2023Updated 3 years ago
- audiolm-pytorch training code☆15Jul 31, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Sequence alignement methods with helpers for PyTorch.☆24Nov 30, 2022Updated 3 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- ☆20Jul 5, 2024Updated last year
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆11Sep 30, 2024Updated last year
- ☆12Apr 25, 2022Updated 4 years ago
- I wanted guided tutorials on digital signal processing so I decided to create them. The result is this ebook: "Digital Signal Processing …☆12Feb 5, 2024Updated 2 years ago
- ICASSP 2023: "Recursive Joint Attention for Audio-Visual Fusion in Regression Based Emotion Recognition"☆14Nov 29, 2024Updated last year