Sofwath / thaanaOCRLinks
Customized version of Keras image_ocr and ockre generalised to handle Thaana/Dhivehi Script.
☆22Updated 8 years ago
Alternatives and similar repositories for thaanaOCR
Users that are interested in thaanaOCR are comparing it to the libraries listed below
Sorting:
- gcv2hocr converts from Google Cloud Vision OCR output to hocr to make a searchable pdf.☆106Updated 4 years ago
- A compound word splitter for Python☆48Updated 3 years ago
- a Deep Learning based Speller☆225Updated 7 years ago
- Arabic Text Detection in Images☆15Updated 7 years ago
- Natural language processing tools for the Dhivehi language.☆15Updated 2 years ago
- A neural network that jointly part-of-speech tags and lemmatizes sentences, boosting accuracy for morphologically-rich languages (Czech, …☆34Updated 6 years ago
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…☆22Updated 11 months ago
- This repository provides our datasets for Arabic emotion detection in Twitter☆9Updated 7 years ago
- Update of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support☆57Updated 4 years ago
- German Morphological Analyzer☆47Updated 3 years ago
- A compound splitter based on the semantic regularities in the vector space of word embeddings.☆16Updated 8 years ago
- A fast and accurate POS and morphological tagging toolkit (EACL 2014)☆141Updated 5 years ago
- Toolbox for OCR post-correction☆121Updated 5 years ago
- CPSC 578 Semester Project☆11Updated 7 years ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 6 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 11 months ago
- Locate and extract tables and figures in PDFs☆42Updated 4 years ago
- Inter-annotator agreement for Doccano☆27Updated 5 years ago
- This repo provides a python module to work with Open Dutch WordNet. It was created using python 3.4.☆67Updated 4 years ago
- OCR-D python tools☆33Updated 11 months ago
- ☆32Updated 6 years ago
- An unsupervised compound splitter☆41Updated 5 years ago
- A Python implementation of the SimString, a simple and efficient algorithm for approximate string matching.☆124Updated last year
- Humanities Entity Recognition: robust, practical, efficient Named Entity Recognition for today's digital humanist☆36Updated 6 years ago
- Hotels Arabic-Reviews Dataset☆32Updated 6 years ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 4 years ago
- Clinical spelling correction with word and character n-gram embeddings.☆74Updated 3 years ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆23Updated last year
- Hidden alignment conditional random field for classifying string pairs.☆36Updated 7 years ago
- OCR evaluation brought to you by University of Alicante☆67Updated 2 years ago