oliverguhr / fullstop-deep-punctuation-prediction
A model that predicts the punctuation of English, Italian, French and German texts.
☆80Updated 2 years ago
Alternatives and similar repositories for fullstop-deep-punctuation-prediction:
Users that are interested in fullstop-deep-punctuation-prediction are comparing it to the libraries listed below
- A python package for deep multilingual punctuation prediction.☆119Updated 8 months ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languages☆212Updated 8 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆112Updated 2 years ago
- Various speech datasets made available to the public☆116Updated 4 months ago
- Support tools for punctuation and boundary detection for ASR output.☆57Updated 2 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆148Updated 11 months ago
- Timething is a library for aligning text transcripts with their audio recordings.☆117Updated 4 months ago
- 📝An easy-to-use package to restore punctuation of the text.☆115Updated 2 years ago
- Universal Romanizer that can convert any unicode script to roman (latin) script☆192Updated 8 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆100Updated 2 months ago
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆49Updated last year
- A lightweight library to compute Diarization Error Rate (DER).☆59Updated last year
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆158Updated this week
- ☆78Updated last year
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆335Updated 11 months ago
- ☆38Updated 3 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆82Updated last year
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆159Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆50Updated 9 months ago
- Simple Diarization model☆47Updated last year
- ☆88Updated 2 weeks ago
- ☆39Updated last year
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆90Updated last year
- Multilingual G2P in 100 languages☆320Updated last year
- ☆35Updated last month
- OpenAI Whisper Prompt Examples☆52Updated last year
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆38Updated 2 years ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆62Updated 3 weeks ago
- Linguistic processing for Common Voice☆55Updated last year