lukerbs / forcealignLinks
ForceAlign is a Python library for forced alignment of English text to English audio. You can use ForceAlign to get word or phoneme level text alignments of audio, with each word or phoneme's start and end time within the audio. ForceAlign was designed to be easy to install and use, without requiring any third-party, non-Python dependencies.
☆20Updated 11 months ago
Alternatives and similar repositories for forcealign
Users that are interested in forcealign are comparing it to the libraries listed below
Sorting:
- An unofficial PyTorch implementation of VALL-E☆88Updated 3 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆102Updated last year
- ☆23Updated last year
- Chinese and English Bilinguish G2P☆21Updated 2 years ago
- Official Code for ParrotTTS☆57Updated last year
- ☆57Updated last year
- ☆100Updated last month
- a lightweight voice conversion☆85Updated last year
- Putting flows on top of neural transducers for better TTS☆64Updated 3 weeks ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated 2 years ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆88Updated 3 weeks ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆32Updated 5 months ago
- Just another FastSpeech 2 but cleaner code :)☆27Updated last year
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆107Updated last year
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆39Updated 2 years ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆76Updated last year
- ☆71Updated 2 years ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆52Updated 2 years ago
- ☆56Updated 2 years ago
- Collection of scripts from mHuBERT-147.☆31Updated 11 months ago
- ☆46Updated this week
- 4G GPU & 10 Minutes for train☆12Updated 2 years ago
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆127Updated 2 years ago
- ☆47Updated 2 weeks ago
- Finetuning VITS Efficiently☆33Updated last year
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆65Updated last year
- Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"☆123Updated 5 months ago
- TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching☆83Updated 3 weeks ago
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…☆105Updated 10 months ago
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆101Updated 5 months ago