FelixMohr / NLP-with-Python
Using Conditional Random Fields for segmenting Latin words written in scriptio continua
☆10Updated 6 years ago
Alternatives and similar repositories for NLP-with-Python:
Users that are interested in NLP-with-Python are comparing it to the libraries listed below
- Post-processing OCR errors with seq2seq models☆28Updated 4 years ago
- OCR-D post-correction module based on weighted finite-state transducers☆11Updated last year
- 'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.☆22Updated 7 years ago
- ☆20Updated 5 years ago
- DFKI Layout Detection for OCR-D☆47Updated 3 months ago
- ☆11Updated 3 years ago
- Segmenting a given document using recursive xy-cut algorithm.☆12Updated 6 years ago
- Practical ML and NLP with examples.☆34Updated last year
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Arabic - English emotion lexicon☆12Updated 7 years ago
- Morphological analyzer and lemmatizer for Latin.☆26Updated 2 weeks ago
- ☆17Updated 6 months ago
- Tool for sentiment analysis annotation☆12Updated 4 months ago
- Generic Environment for Context-Aware Correction of Orthography☆22Updated 2 years ago
- Converts TensorFlow checkpoints (with index, meta and data files) to PyTorch, HDF5 and JSON☆18Updated 3 years ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Updated 2 years ago
- Machine Learning-assisted correction of OCR errors in historical corpora☆9Updated 3 months ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Updated last year
- Unicode Standard tokenization routines and orthography profile segmentation☆35Updated this week
- Text classification automl☆21Updated 3 years ago
- Breaks a word into syllables using an LSTM-based neural network.☆19Updated last year
- ☆17Updated last year
- This repository is about how to build an SQLite version of the Arabic WordNet database.☆11Updated 5 years ago
- A collection of notebooks for Natural Language Processing☆25Updated last month
- Jena Semantic Explorer☆11Updated 4 months ago
- End-2-end multi-label classification in python☆33Updated 2 years ago
- Process Caltech Archives' digital documents and photos, and annotate each page or image with information about its contents☆12Updated 2 years ago
- A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.☆26Updated 3 years ago
- Arabic Word-Embedding (Word2vec) model training from Wikipedia articles☆11Updated 6 years ago
- Arabic Phonetic Dictionary Generator Tool for Automatic Speech Recognition Applications☆13Updated 3 years ago