notAI-tech / fastPunct
Punctuation restoration and spell correction experiments.
☆251Updated 4 years ago
Alternatives and similar repositories for fastPunct:
Users that are interested in fastPunct are comparing it to the libraries listed below
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languages☆209Updated 7 months ago
- Text and Punctuation correction with Deep Learning☆128Updated 4 years ago
- Experimental project to punctuate text using a embedding layer, single convolutional layer and output softmax layer written in Keras.☆83Updated 4 years ago
- A sentence segmenter that actually works!☆305Updated 4 years ago
- 📝An easy-to-use package to restore punctuation of the text.☆114Updated last year
- A module for normalising text.☆173Updated 3 years ago
- xfspell — the Transformer Spell Checker☆189Updated 4 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆674Updated 3 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆204Updated 3 years ago
- CMU Wilderness Multilingual Speech Dataset☆277Updated 5 years ago
- A model that predicts the punctuation of English, Italian, French and German texts.☆80Updated 2 years ago
- A python package for deep multilingual punctuation prediction.☆119Updated 7 months ago
- Support tools for punctuation and boundary detection for ASR output.☆57Updated 2 years ago
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆154Updated 5 years ago
- DeepSpeech based forced alignment tool☆237Updated 4 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆114Updated 5 years ago
- Universal Romanizer that can convert any unicode script to roman (latin) script☆185Updated 7 months ago
- Segment an audio file and obtain utterance alignments. (Python package)☆333Updated 10 months ago
- SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model☆180Updated 5 years ago
- An LSTM RNN for restoring missing punctuation in unsegmented text.☆79Updated 8 years ago
- Fast and accurate spell correction library☆81Updated 3 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆241Updated 5 years ago
- CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)☆371Updated 3 years ago
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆48Updated last year
- [deprecated] Pretrained models for pyannote-audio 1.x☆72Updated 2 years ago
- 📃Language Model based sentences scoring library☆307Updated 3 years ago
- Massively multilingual pronunciation mining☆331Updated 2 weeks ago
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆105Updated last month
- Model for recasing and repunctuating ASR transcripts☆133Updated 11 months ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆468Updated 5 years ago