notAI-tech / fastPunct
Punctuation restoration and spell correction experiments.
☆246Updated 3 years ago
Related projects: ⓘ
- A sentence segmenter that actually works!☆303Updated 4 years ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languages☆203Updated last month
- Text and Punctuation correction with Deep Learning☆129Updated 4 years ago
- Experimental project to punctuate text using a embedding layer, single convolutional layer and output softmax layer written in Keras.☆83Updated 3 years ago
- 📝An easy-to-use package to restore punctuation of the text.☆107Updated last year
- CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)☆338Updated 3 years ago
- xfspell — the Transformer Spell Checker☆186Updated 4 years ago
- A module for normalising text.☆172Updated 2 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆319Updated 4 months ago
- A python package for deep multilingual punctuation prediction.☆87Updated last month
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆660Updated 3 years ago
- 📃Language Model based sentences scoring library☆300Updated 2 years ago
- CMU Wilderness Multilingual Speech Dataset☆272Updated 5 years ago
- DeepSpeech based forced alignment tool☆232Updated 3 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆112Updated 5 years ago
- Grapheme to phoneme conversion with deep learning.☆349Updated 9 months ago
- SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model☆180Updated 5 years ago
- A model that predicts the punctuation of English, Italian, French and German texts.☆70Updated last year
- Massively multilingual pronunciation mining☆315Updated 2 weeks ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆508Updated last year
- ✔️Contextual word checker for better suggestions☆405Updated 6 months ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆229Updated 4 years ago
- A fast and lightweight python-based CTC beam search decoder for speech recognition.☆421Updated last year
- Universal Romanizer that can convert any unicode script to roman (latin) script☆145Updated last month
- Support tools for punctuation and boundary detection for ASR output.☆57Updated last year
- Text2Text: Crosslingual NLP/G toolkit☆283Updated this week
- a pytorch implementation of auto-punctuation learned character by character☆141Updated 3 years ago
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆151Updated 4 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆70Updated 2 years ago
- Fast and accurate spell correction library☆74Updated 2 years ago