FelixMohr / NLP-with-PythonLinks
Using Conditional Random Fields for segmenting Latin words written in scriptio continua
☆10Updated 7 years ago
Alternatives and similar repositories for NLP-with-Python
Users that are interested in NLP-with-Python are comparing it to the libraries listed below
Sorting:
- Post-processing OCR errors with seq2seq models☆28Updated 4 years ago
- 'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.☆22Updated 7 years ago
- Wrapper around pixel classifier☆9Updated 3 years ago
- A selection of test lines of several early printed books as well as the corresponding individual OCRopus models and mixed models.☆10Updated 7 years ago
- OCR-D post-correction module based on weighted finite-state transducers☆11Updated last year
- Given a text, wrap it into phrases and send them to Yandex's search engine. If it yields a "did you mean:", substitute the original phras…☆11Updated 6 years ago
- ☆23Updated 5 years ago
- ☆17Updated last year
- In-browser OCR of Ancient Greek and Latin☆26Updated last month
- ☆11Updated 3 years ago
- Featurize words into orthographic and phonological vectors.☆41Updated 2 years ago
- Morphological analyzer and lemmatizer for Latin.☆27Updated 4 months ago
- Generic Environment for Context-Aware Correction of Orthography☆22Updated 2 years ago
- Dictionaries of names, surnames, acronyms and it's extensions, stop-words, etc., which I gathered for different experiments.☆28Updated 8 years ago
- This repository is about how to build an SQLite version of the Arabic WordNet database.☆10Updated 6 years ago
- DFKI Layout Detection for OCR-D☆47Updated last month
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Arabic Text Detection in Images☆15Updated 7 years ago
- Text classification automl☆21Updated 3 years ago
- Segmenting a given document using recursive xy-cut algorithm.☆12Updated 6 years ago
- ☆10Updated 6 years ago
- English web corpus with 4M tokens and several annotation types☆26Updated last year
- 🚀GUI for training spaCy models☆55Updated 4 years ago
- Generate variations of text through synonym matching☆12Updated 7 years ago
- ☆17Updated 9 months ago
- A plugin for the GATE language technology framework for training and using machine learning models. Currently supports Mallet (MaxEnt, N…☆26Updated 2 years ago
- A python module to process data for Frame Semantic Parsing☆24Updated 4 years ago
- ☆10Updated 5 years ago
- Machine Learning-assisted correction of OCR errors in historical corpora☆9Updated 7 months ago
- This repo contains collection of various mini projects.☆13Updated 6 years ago