Sofwath / thaanaOCR
Customized version of Keras image_ocr and ockre generalised to handle Thaana/Dhivehi Script.
☆21Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for thaanaOCR
- Natural language processing tools for the Dhivehi language.☆15Updated 2 years ago
- Demo code for the pre-trained TTS system☆17Updated 3 years ago
- Labeled segmentation for the document structure of printed books☆13Updated 7 years ago
- A compound splitter based on the semantic regularities in the vector space of word embeddings.☆16Updated 7 years ago
- PHP, JS script to transliterate thaana (dhivehi)☆21Updated 2 years ago
- The NLG tool for Finnish☆22Updated 11 months ago
- eXternally configurable REference and Non Named Entity Recognizer☆17Updated 4 months ago
- A fast and accurate POS and morphological tagging toolkit (EACL 2014)☆140Updated 4 years ago
- Toolbox for OCR post-correction☆123Updated 5 years ago
- A neural network that jointly part-of-speech tags and lemmatizes sentences, boosting accuracy for morphologically-rich languages (Czech, …☆34Updated 5 years ago
- Double-checked Gold Standard Data for Training and Testing OCR Engines☆15Updated last year
- gcv2hocr converts from Google Cloud Vision OCR output to hocr to make a searchable pdf.☆101Updated 4 years ago
- OCR-D python tools☆33Updated 2 months ago
- Spacy model trained based on Norwegian corpus converted from OBT to Universal dep.☆13Updated 6 years ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 3 years ago
- Update of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support☆56Updated 3 years ago
- Labeled examples from wiki dumps in Python☆68Updated 8 years ago
- Arabic data☆10Updated this week
- BML as a console application.☆9Updated 3 years ago
- A dataset of handwritten and computer generated Thaana glyphs☆11Updated last year
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Updated last year
- Arabic Text Detection in Images☆15Updated 6 years ago
- Humanities Entity Recognition: robust, practical, efficient Named Entity Recognition for today's digital humanist☆38Updated 5 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 3 months ago
- Search back-end for dependency tree search. See the docs at https://fginter.github.io/dep_search/☆17Updated 6 years ago
- ☆16Updated 3 years ago
- Multi Tier Annotation Search☆26Updated 3 years ago
- Named Entities Recognition Annotator Tool for Europeana Newspapers☆60Updated 6 years ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆180Updated this week