oliverguhr / deepmultilingualpunctuationLinks
A python package for deep multilingual punctuation prediction.
β132Updated last year
Alternatives and similar repositories for deepmultilingualpunctuation
Users that are interested in deepmultilingualpunctuation are comparing it to the libraries listed below
Sorting:
- A model that predicts the punctuation of English, Italian, French and German texts.β80Updated 2 years ago
- πAn easy-to-use package to restore punctuation of the text.β118Updated 2 years ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languagesβ221Updated last year
- Timething is a library for aligning text transcripts with their audio recordings.β123Updated 10 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeβ150Updated last year
- Various speech datasets made available to the publicβ131Updated 9 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisperβ118Updated 2 years ago
- Segment an audio file and obtain utterance alignments. (Python package)β341Updated last year
- Universal Romanizer that can convert any unicode script to roman (latin) scriptβ225Updated last year
- A tokenizer, text cleaner, and phonemizer for many human languages.β327Updated 10 months ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languagesβ170Updated 2 years ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!β176Updated this week
- β39Updated 3 years ago
- β43Updated 2 years ago
- β37Updated 5 months ago
- Model for recasing and repunctuating ASR transcriptsβ139Updated last year
- A merged version of multiple open-source German speech datasets.β33Updated last year
- Text to speech alignment using CTC forced alignmentβ366Updated last month
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decodingβ76Updated 3 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languagesβ140Updated last year
- Finetune VITS and MMS using HuggingFace's toolsβ164Updated last year
- Multilingual G2P in 100 languagesβ357Updated 2 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.β369Updated last year
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.β82Updated 2 years ago
- β359Updated last year
- Grapheme to phoneme conversion with deep learning.β400Updated last year
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)β95Updated last year
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.β41Updated 3 weeks ago
- Reproducible experimental protocols for multimedia (audio, video, text) databaseβ107Updated 2 weeks ago
- β133Updated last week