r4victor / afalignerLinks
📈 A forced aligner intended for synchronization of narrated text
☆93Updated 2 years ago
Alternatives and similar repositories for afaligner
Users that are interested in afaligner are comparing it to the libraries listed below
Sorting:
- Timething is a library for aligning text transcripts with their audio recordings.☆122Updated 7 months ago
- Synchronize Whisper's timestamps over an existing accurate transcription☆153Updated last year
- A model that predicts the punctuation of English, Italian, French and German texts.☆80Updated 2 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆36Updated 4 years ago
- 📖🎧 A tool for creating ebooks with synchronized text and audio (EPUB3 with Media Overlays)☆309Updated last year
- web based editor for subtitles and transcripts☆137Updated 10 months ago
- Audiobook alignment for Indigenous languages☆40Updated this week
- A tokenizer, text cleaner, and phonemizer for many human languages.☆318Updated 7 months ago
- Performant and accurate speech recognition built on Pytorch☆253Updated 3 years ago
- An even smaller speech recognizer / force aligner☆34Updated 6 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆116Updated 2 years ago
- A python package for deep multilingual punctuation prediction.☆127Updated 10 months ago
- British English pronunciation dictionary☆95Updated 7 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated last year
- 🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.☆172Updated 5 years ago
- ez audio transcription tool with flexible processing and post-processing options☆154Updated last year
- Gecko - A Tool for Effective Annotation of Human Conversations☆291Updated 2 years ago
- pronunciation dictionaries for multiple languages☆88Updated 7 years ago
- Model for recasing and repunctuating ASR transcripts☆135Updated last year
- Python interface to ISLEX, an English IPA pronunciation dictionary with syllable and stress marking.☆52Updated last year
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆170Updated last month
- DeepSpeech based forced alignment tool☆238Updated 4 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆149Updated last year
- Convert Arpabet to IPA. Arpabet is the set of phonemes used by the CMU Pronouncing Dictionary. IPA is the International Phonetic Alphabet…☆44Updated 4 years ago
- Desktop application for neural speech synthesis written in C++☆215Updated 2 years ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆331Updated 8 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆116Updated 2 years ago
- ipapy is a Python module to work with International Phonetic Alphabet (IPA) strings☆87Updated last year
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆92Updated last year
- Massively multilingual pronunciation mining☆344Updated last month