uds-lsv / Noisy-Channel-Spell-Checker
A tool for correcting misspellings in textual input using the Noisy Channel Model.
β11Updated 4 years ago
Alternatives and similar repositories for Noisy-Channel-Spell-Checker:
Users that are interested in Noisy-Channel-Spell-Checker are comparing it to the libraries listed below
- OCR-D post-correction module based on weighted finite-state transducersβ11Updated last year
- Minimal code to train ELMo models in recent versions of TensorFlowβ14Updated last year
- π― Speech Recognition Challenge by Speech Lab - IIT Madrasβ11Updated 4 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawlerβ23Updated 4 years ago
- A simple neural truecaser written in pytorch and allennlp.β33Updated 10 months ago
- β11Updated 3 years ago
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers atβ¦β22Updated 8 months ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languagesβ13Updated 2 years ago
- An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For instβ¦β22Updated 3 years ago
- GC4LM: A Colossal (Biased) language model for Germanβ13Updated 3 years ago
- Unicode Standard tokenization routines and orthography profile segmentationβ35Updated 2 months ago
- BERT and ELECTRA models trained on Europeana Newspapersβ38Updated 3 years ago
- Featurize words into orthographic and phonological vectors.β40Updated last year
- Deep neural approach to Boundary and Disfluency Detection - Based on my Master's workβ19Updated 9 months ago
- Multilingual Open Textβ25Updated 5 months ago
- Spell checker using Brill and Moore's noisy channel error modelβ11Updated 6 years ago
- Arabic Phonetic Dictionary Generator Tool for Automatic Speech Recognition Applicationsβ13Updated 3 years ago
- List of corpora annotated for coreference for different languagesβ17Updated 8 months ago
- A tiny BERT for low-resource monolingual modelsβ31Updated 6 months ago
- β17Updated last year
- Tool for parsing and converting various span encoding schemes.β23Updated last year
- OCRopus model for Gothic print (Fraktur)β18Updated 5 years ago
- A python module for evaluating NERC and NEL system performances as defined in the HIPE shared tasks (formerly CLEF-HIPE-2020-scorer).β14Updated 10 months ago
- KenLM extension for spaCy 2.0.β16Updated 7 years ago
- Temporary remove unused tokens during training to save ram and speed.β22Updated last week
- MAMMOTH: MAssively Multilingual Modular Open Translation @ Helsinkiβ23Updated 2 months ago
- Two-Step Approach to OCR Post-Correctionβ14Updated 11 months ago
- Breaks a word into syllables using an LSTM-based neural network.β19Updated last year
- Finite-state script normalization and processing utilitiesβ39Updated last month
- β64Updated 2 years ago