lukerbs / forcealignLinks
ForceAlign is a Python library for forced alignment of English text to English audio. You can use ForceAlign to get word or phoneme level text alignments of audio, with each word or phoneme's start and end time within the audio. ForceAlign was designed to be easy to install and use, without requiring any third-party, non-Python dependencies.
☆17Updated 8 months ago
Alternatives and similar repositories for forcealign
Users that are interested in forcealign are comparing it to the libraries listed below
Sorting:
- Official Code for ParrotTTS☆53Updated 9 months ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆81Updated this week
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆99Updated last year
- An unofficial PyTorch implementation of VALL-E☆87Updated last week
- Finetuning VITS Efficiently☆33Updated last year
- ☆65Updated last month
- a lightweight voice conversion☆84Updated 11 months ago
- Putting flows on top of neural transducers for better TTS☆62Updated last month
- Chinese and English Bilinguish G2P☆21Updated 2 years ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆52Updated last year
- A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.☆75Updated 9 months ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆28Updated 3 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆101Updated 10 months ago
- 4G GPU & 10 Minutes for train☆12Updated 2 years ago
- ☆19Updated last year
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆18Updated 8 months ago
- ☆22Updated 9 months ago
- Just another FastSpeech 2 but cleaner code :)☆26Updated last year
- All generative model in one for better TTS model☆72Updated 11 months ago
- ☆58Updated last year
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆94Updated 8 months ago
- ☆41Updated 10 months ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated 2 years ago
- ☆71Updated 2 years ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆73Updated 9 months ago
- ☆69Updated 2 years ago
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆97Updated 2 months ago
- Grapheme-to-Phoneme lexicons for Chinese dialects☆69Updated 2 years ago
- ☆56Updated 2 years ago
- ☆29Updated 6 months ago