FelixMohr / NLP-with-PythonLinks
Using Conditional Random Fields for segmenting Latin words written in scriptio continua
☆10Updated 7 years ago
Alternatives and similar repositories for NLP-with-Python
Users that are interested in NLP-with-Python are comparing it to the libraries listed below
Sorting:
- Post-processing OCR errors with seq2seq models☆28Updated 5 years ago
- OCR-D post-correction module based on weighted finite-state transducers☆11Updated 2 years ago
- Arabic Word-Embedding (Word2vec) model training from Wikipedia articles☆11Updated 7 years ago
- DFKI Layout Detection for OCR-D☆47Updated 9 months ago
- 'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.☆22Updated 7 years ago
- Quill's library of open source NLP algorithms and data sets.☆52Updated last year
- 🚀GUI for training spaCy models☆55Updated 4 years ago
- Given a text, wrap it into phrases and send them to Yandex's search engine. If it yields a "did you mean:", substitute the original phras…☆11Updated 7 years ago
- Python library for advanced text mining☆69Updated 5 years ago
- ☆10Updated 2 years ago
- Generic Environment for Context-Aware Correction of Orthography☆22Updated 3 years ago
- Tensorflow Implementation of FaceNet: A Unified Embedding for Face Recognition and Clustering to find the celebrity whose face matches th…☆31Updated 3 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 3 years ago
- Deploy Pytorch models to production via panini☆10Updated 6 years ago
- Text classification automl☆21Updated 4 years ago
- This repository is about how to build an SQLite version of the Arabic WordNet database.☆10Updated 6 years ago
- General-Purpose Neural Networks for Sentence Boundary Detection☆73Updated 2 years ago
- Next generation OCR engine based on LSTMs.☆52Updated 7 years ago
- Toolbox for OCR post-correction☆122Updated 6 years ago
- Deep neural parser for database query☆18Updated 3 years ago
- Arabic Text Detection in Images☆15Updated 7 years ago
- ISI's Optical Character Recognition (OCR) software for machine-print and handwriting data☆27Updated 6 years ago
- A Unet based deeplearning model to line/box/spurious artifacts from text images. Unsupervised training.☆61Updated 6 years ago
- Demonstration of using Caffe2 inside an Android application.☆10Updated 7 years ago
- An open source, API centric chat bot platform in Django Rest framework developed with python chatterbot and NLTK packages.☆11Updated 2 years ago
- A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.☆83Updated last year
- Extremely easy to use sequence to sequence library with attention, for text to text conversion tasks.☆39Updated 5 years ago
- ☆17Updated last year
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated 9 months ago
- Photos and artwork images with object annotations for academic use only☆28Updated 9 years ago