Felflare / rpunct
πAn easy-to-use package to restore punctuation of the text.
β108Updated last year
Related projects β
Alternatives and complementary repositories for rpunct
- Complimentary code for our paper Automatic punctuation restoration with BERT modelsβ48Updated last year
- Support tools for punctuation and boundary detection for ASR output.β57Updated last year
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languagesβ204Updated 3 months ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2β112Updated 5 years ago
- A model that predicts the punctuation of English, Italian, French and German texts.β72Updated last year
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decodingβ71Updated 3 years ago
- A python package for deep multilingual punctuation prediction.β94Updated 2 months ago
- Improving Disfluency Detection by Self-Training a Self-Attentive Modelβ47Updated 3 years ago
- Various speech datasets made available to the publicβ98Updated last month
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterancesβ48Updated last month
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecodeβ110Updated 2 years ago
- A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. Oβ¦β62Updated 8 months ago
- Punctuation restoration and spell correction experiments.β248Updated 3 years ago
- Universal Romanizer that can convert any unicode script to roman (latin) scriptβ150Updated 3 months ago
- A curated list of awesome disfluency detection publications along with the released code and bibliographical informationβ70Updated 3 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR modelsβ31Updated 3 years ago
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translationβ37Updated last year
- A guide to building language technology in new languages.β57Updated 2 years ago
- β41Updated last year
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Modelβ106Updated 3 years ago
- Example code for a neural transducer model.β59Updated 8 months ago
- β74Updated 3 years ago
- Repository containing the open source code of works published at the FBK MT unit.β42Updated 4 months ago
- Segment an audio file and obtain utterance alignments. (Python package)β321Updated 5 months ago
- Bicleaner fork that uses neural networksβ38Updated 3 months ago
- Wave2vec 2.0 Recognize pipelineβ33Updated 3 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languagesβ130Updated 7 months ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.β85Updated 2 years ago
- β101Updated 3 years ago
- [deprecated] Pretrained models for pyannote-audio 1.xβ71Updated 2 years ago