lukerbs / forcealign
ForceAlign is a Python library for forced alignment of English text to English audio. You can use ForceAlign to get word or phoneme level text alignments of audio, with each word or phoneme's start and end time within the audio. ForceAlign was designed to be easy to install and use, without requiring any third-party, non-Python dependencies.
☆13Updated 3 months ago
Alternatives and similar repositories for forcealign:
Users that are interested in forcealign are comparing it to the libraries listed below
- ☆56Updated 2 years ago
- ☆20Updated 5 months ago
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆39Updated last year
- Official Code for ParrotTTS☆48Updated 5 months ago
- ☆56Updated 9 months ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆16Updated 11 months ago
- A Tiny Project For ASR model training and Deployment☆27Updated 2 years ago
- Just another FastSpeech 2 but cleaner code :)☆26Updated 9 months ago
- Decoders from Kaldi using OpenFst☆27Updated 2 months ago
- ☆13Updated 7 months ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- Aligner for text-to-speech☆14Updated 8 months ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Updated 2 years ago
- 4G GPU & 10 Minutes for train☆12Updated last year
- text to speech☆10Updated last year
- ☆28Updated last year
- Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.☆25Updated this week
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Updated 2 years ago
- Open Source Speech/Text Data on AI☆18Updated 2 years ago
- Torchaudio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆11Updated 3 months ago
- A collection of all our phonemeizers for dataset construction and inference☆22Updated last month
- 'Grad-TTS' with Multilingual Cleaners☆10Updated 11 months ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated last year
- E2E TTS using Conditional Flow Matching (Experimental*)☆69Updated last year
- Temporary anonymous version☆22Updated last year
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Updated 6 months ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated 2 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆27Updated 7 months ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆26Updated 8 months ago
- GPT for FACodec☆13Updated last year