FelixMohr / NLP-with-Python
Using Conditional Random Fields for segmenting Latin words written in scriptio continua
☆10Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for NLP-with-Python
- In-browser OCR of Ancient Greek and Latin☆23Updated last week
- 'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.☆22Updated 6 years ago
- Post-processing OCR errors with seq2seq models☆28Updated 4 years ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 5 years ago
- OCR-D post-correction module based on weighted finite-state transducers☆11Updated 10 months ago
- Featurize words into orthographic and phonological vectors.☆40Updated last year
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Updated last year
- Wrapper around pixel classifier☆9Updated 2 years ago
- Generic Environment for Context-Aware Correction of Orthography☆22Updated 2 years ago
- Morphological analyzer and lemmatizer for Latin.☆25Updated 2 weeks ago
- Breaks a word into syllables using an LSTM-based neural network.☆19Updated last year
- Topic Modeling Workflow in Python☆16Updated last year
- spaCy-to-naf converter☆21Updated 5 months ago
- ☆11Updated 2 years ago
- Humanities Entity Recognition: robust, practical, efficient Named Entity Recognition for today's digital humanist☆38Updated 5 years ago
- Arabic Word-Embedding (Word2vec) model training from Wikipedia articles☆11Updated 5 years ago
- This repository is about how to build an SQLite version of the Arabic WordNet database.☆11Updated 5 years ago
- ☆17Updated last year
- An implementation of GrASP (Shnarch et. al., 2017)☆21Updated 2 years ago
- Machine Learning-assisted correction of OCR errors in historical corpora☆9Updated 2 weeks ago
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆40Updated 5 years ago
- 🚀GUI for training spaCy models☆53Updated 3 years ago
- Command-line tool to extract a ranked list of relevant keywords from a corpus with the option of using either topic modeling or tf-idf sc…☆41Updated 7 years ago
- Language Tool style grammar handling with spaCy 2.0☆42Updated 6 years ago
- ☆22Updated 2 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆39Updated last year
- Wrapper for DKPro Core to extract lingustic information from books.☆16Updated 2 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Software for multi-level annotation of linguistic corpora☆17Updated 4 years ago
- ☆10Updated last year