notAI-tech / fastPunct
Punctuation restoration and spell correction experiments.
☆248Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for fastPunct
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languages☆204Updated 3 months ago
- A sentence segmenter that actually works!☆302Updated 4 years ago
- Text and Punctuation correction with Deep Learning☆129Updated 4 years ago
- Experimental project to punctuate text using a embedding layer, single convolutional layer and output softmax layer written in Keras.☆83Updated 4 years ago
- A module for normalising text.☆173Updated 3 years ago
- 📝An easy-to-use package to restore punctuation of the text.☆108Updated last year
- A model that predicts the punctuation of English, Italian, French and German texts.☆74Updated last year
- xfspell — the Transformer Spell Checker☆187Updated 4 years ago
- Support tools for punctuation and boundary detection for ASR output.☆57Updated last year
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆112Updated 5 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆663Updated 3 years ago
- DeepSpeech based forced alignment tool☆235Updated 3 years ago
- SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model☆179Updated 5 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆200Updated 3 years ago
- Massively multilingual pronunciation mining☆321Updated this week
- Universal Romanizer that can convert any unicode script to roman (latin) script☆154Updated 3 months ago
- 📃Language Model based sentences scoring library☆303Updated 2 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆321Updated 6 months ago
- PyTorch code for end-to-end spoken language understanding (SLU) with ASR-based transfer learning☆225Updated 3 years ago
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆153Updated 4 years ago
- A python package for deep multilingual punctuation prediction.☆98Updated 3 months ago
- Grapheme to phoneme conversion with deep learning.☆358Updated 11 months ago
- CMU Wilderness Multilingual Speech Dataset☆272Updated 5 years ago
- a pytorch implementation of auto-punctuation learned character by character☆141Updated 4 years ago
- NeuSpell: A Neural Spelling Correction Toolkit☆671Updated last year
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆512Updated last year
- Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.☆221Updated 3 months ago
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆48Updated last year
- CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)☆352Updated 3 years ago
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆103Updated 2 weeks ago