notAI-tech / fastPunctLinks
Punctuation restoration and spell correction experiments.
β252Updated 4 years ago
Alternatives and similar repositories for fastPunct
Users that are interested in fastPunct are comparing it to the libraries listed below
Sorting:
- πAn easy-to-use package to restore punctuation of the text.β119Updated 2 years ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languagesβ223Updated last year
- A sentence segmenter that actually works!β304Updated 5 years ago
- Text and Punctuation correction with Deep Learningβ128Updated 5 years ago
- A module for normalising text.β173Updated 4 years ago
- Experimental project to punctuate text using a embedding layer, single convolutional layer and output softmax layer written in Keras.β83Updated 5 years ago
- Complimentary code for our paper Automatic punctuation restoration with BERT modelsβ50Updated 2 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2β115Updated 6 years ago
- xfspell β the Transformer Spell Checkerβ189Updated 5 years ago
- A model that predicts the punctuation of English, Italian, French and German texts.β83Updated 2 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecodeβ111Updated 3 years ago
- Automatically constructing corpus for automatic speech recognition from YouTube videosβ157Updated 5 years ago
- Support tools for punctuation and boundary detection for ASR output.β56Updated 3 years ago
- DeepSpeech based forced alignment toolβ239Updated 4 years ago
- CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)β392Updated 4 years ago
- Universal Romanizer that can convert any unicode script to roman (latin) scriptβ232Updated last year
- A python package for deep multilingual punctuation prediction.β151Updated last year
- Python library for converting numbers to words for all Indian Languages.β37Updated 6 months ago
- An LSTM RNN for restoring missing punctuation in unsegmented text.β78Updated 9 years ago
- πLanguage Model based sentences scoring libraryβ308Updated 3 years ago
- CMU Wilderness Multilingual Speech Datasetβ288Updated 6 years ago
- Segment an audio file and obtain utterance alignments. (Python package)β343Updated last year
- SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained modelβ181Updated 6 years ago
- A tool for automatic phoneme transcriptionβ159Updated 2 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented textβ682Updated 4 years ago
- Text2Text Language Modeling Toolkitβ303Updated 10 months ago
- Improving Disfluency Detection by Self-Training a Self-Attentive Modelβ47Updated 4 years ago
- Massively multilingual pronunciation miningβ357Updated 3 months ago
- A list of publically available audio data that anyone can download for ASR or other speech activitiesβ232Updated 4 years ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.β130Updated 4 years ago