lukerbs / forcealign
ForceAlign is a Python library for forced alignment of English text to English audio. You can use ForceAlign to get word or phoneme level text alignments of audio, with each word or phoneme's start and end time within the audio. ForceAlign was designed to be easy to install and use, without requiring any third-party, non-Python dependencies.
☆12Updated 3 months ago
Alternatives and similar repositories for forcealign:
Users that are interested in forcealign are comparing it to the libraries listed below
- ☆20Updated 4 months ago
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆39Updated last year
- Open Source Speech/Text Data on AI☆18Updated 2 years ago
- text to speech☆10Updated 11 months ago
- Aligner for text-to-speech☆14Updated 7 months ago
- ☆56Updated 2 years ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆26Updated 7 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆27Updated 7 months ago
- 'Grad-TTS' with Multilingual Cleaners☆10Updated 11 months ago
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆25Updated 3 months ago
- A Tiny Project For ASR model training and Deployment☆27Updated 2 years ago
- ☆33Updated 3 years ago
- GPT for FACodec☆13Updated 11 months ago
- ☆13Updated 6 months ago
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Updated last year
- ☆13Updated last year
- English conversation corpus for conversational TTS.☆20Updated 2 years ago
- Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…☆32Updated 8 months ago
- Official Code for ParrotTTS☆48Updated 5 months ago
- Just another FastSpeech 2 but cleaner code :)☆26Updated 8 months ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 2 years ago
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Updated 2 years ago
- (WIP)long form speech generatoins☆30Updated 3 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors☆18Updated last month
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Updated 3 years ago
- Decoders from Kaldi using OpenFst☆27Updated 2 months ago
- ☆41Updated last year