uds-lsv / Noisy-Channel-Spell-Checker
A tool for correcting misspellings in textual input using the Noisy Channel Model.
☆11Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for Noisy-Channel-Spell-Checker
- BERT and ELECTRA models trained on Europeana Newspapers☆36Updated 2 years ago
- Generic Environment for Context-Aware Correction of Orthography☆22Updated 2 years ago
- A simple neural truecaser written in pytorch and allennlp.☆32Updated 5 months ago
- Learning BPE embeddings by first learning a segmentation model and then training word2vec☆19Updated last year
- Unicode Standard tokenization routines and orthography profile segmentation☆33Updated 2 years ago
- An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For inst…☆21Updated 2 years ago
- an experimental implementation of Burrow's delta in Python 3☆20Updated 3 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Updated 3 years ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 5 years ago
- These are lists for a variety of languages containing words that are distinctive to each language.☆34Updated 2 years ago
- zero-vocab or low-vocab embeddings☆17Updated 2 years ago
- A tiny BERT for low-resource monolingual models☆29Updated last month
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated last year
- docker for HF wav2vec2-sprint☆12Updated 3 years ago
- Temporary remove unused tokens during training to save ram and speed.☆22Updated 4 months ago
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆15Updated 2 weeks ago
- OCR-D post-correction module based on weighted finite-state transducers☆11Updated 10 months ago
- ☆17Updated last year
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Updated last year
- Breaks a word into syllables using an LSTM-based neural network.☆19Updated last year
- ☆15Updated 5 years ago
- Source code for the Apple reproduction☆31Updated 3 years ago
- Sequence to sequence model for Arabic punctuation prediction.☆12Updated 4 years ago
- Multilingual Open Text☆25Updated 3 weeks ago
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…☆22Updated 3 months ago
- Post-processing OCR errors with seq2seq models☆28Updated 4 years ago
- GC4LM: A Colossal (Biased) language model for German☆13Updated 3 years ago
- Tool for parsing and converting various span encoding schemes.☆22Updated 10 months ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Updated 4 years ago
- Wrapper for DKPro Core to extract lingustic information from books.☆16Updated 2 years ago