r4victor / afaligner
π A forced aligner intended for synchronization of narrated text
β87Updated last year
Alternatives and similar repositories for afaligner:
Users that are interested in afaligner are comparing it to the libraries listed below
- Timething is a library for aligning text transcripts with their audio recordings.β111Updated last month
- ππ§ A tool for creating ebooks with synchronized text and audio (EPUB3 with Media Overlays)β282Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisperβ107Updated last year
- Synchronize Whisper's timestamps over an existing accurate transcriptionβ138Updated 7 months ago
- A python package for deep multilingual punctuation prediction.β111Updated 4 months ago
- DeepSpeech based forced alignment toolβ235Updated 4 years ago
- An even smaller speech recognizer / force alignerβ32Updated last month
- Gecko - A Tool for Effective Annotation of Human Conversationsβ279Updated last year
- On-device voice activity detection (VAD) powered by deep learningβ190Updated this week
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented textβ36Updated 4 years ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!β144Updated this week
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.β135Updated last year
- Python interface for forced audio alignment using HTK and SoXβ334Updated 4 years ago
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)β84Updated last year
- A model that predicts the punctuation of English, Italian, French and German texts.β77Updated last year
- ez audio transcription tool with flexible processing and post-processing optionsβ140Updated 11 months ago
- python3.6+ port of aeneasβ14Updated 3 years ago
- Python forced alignmentβ77Updated 9 months ago
- π¦ A collection of files for LibriVox recordings to produce ebooks with synchronized text and audioβ25Updated 4 years ago
- The CMU Pronouncing Dictionary converted to IPAβ78Updated 5 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languagesβ151Updated last year
- Audiobook alignment for Indigenous languagesβ38Updated 3 weeks ago
- A tool for automatic phoneme transcriptionβ157Updated last year
- A PyTorch demo of the paper Voice Separation with an Unknown Number of Multiple Speakers using gradio and Nvidia NEMO ASR model.β35Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeβ144Updated 8 months ago
- Modified version of RusStress (https://github.com/MashaPo/russtress) β python package for placing stress in Russian text using RNN (BiLSTβ¦β30Updated 5 months ago
- Grapheme to phoneme conversion with deep learning.β367Updated last year
- β25Updated 9 months ago
- The EveryVoice TTS Toolkit - Text To Speech for your languageβ24Updated this week
- β30Updated 6 months ago