mrychlik / worldly-ocr
Text-to-image conversion (OCR) for Pashto and Chinese, with a view towards comprehensive, multi-lingual OCR
☆18Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for worldly-ocr
- Segmenting a given document using recursive xy-cut algorithm.☆12Updated 6 years ago
- Japanese trained data of clstm☆15Updated 8 years ago
- ☆20Updated 5 years ago
- Course in Document and Content Analysis.☆14Updated 4 years ago
- Given a text, wrap it into phrases and send them to Yandex's search engine. If it yields a "did you mean:", substitute the original phras…☆11Updated 5 years ago
- Movielens collaborative filtering with Solr streaming expression☆11Updated 8 years ago
- Risk Minimization Algorithms in Structured Prediction (JMLR 2016)☆13Updated 7 years ago
- Course in Natural Language Processing and Applications☆10Updated 2 years ago
- OCR-D post-correction module based on weighted finite-state transducers☆11Updated 10 months ago
- OpenMPF core, including the Workflow Manager web application☆30Updated this week
- PAGE XML format collection for document image page content and more☆66Updated 3 years ago
- Simple CTC implementation for PyTorch☆14Updated 7 years ago
- PDF Table Extractor - repository to hold revisable version of code from https://www.cvast.tuwien.ac.at/projects/pdf2table by Burcu Yildiz☆38Updated 8 months ago
- This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified an…☆22Updated 4 years ago
- Neural Elastic Inference and Search☆19Updated 5 years ago
- ClickModels for Search Engines Implemented on top of Cython.☆13Updated 3 years ago
- StarGraph (aka *graph) is a graph database to query large Knowledge Graphs. Playing with Knowledge Graphs can be useful if you are develo…☆32Updated this week
- Towards an Understanding of Entity-Oriented Search Intents - ECIR'18☆9Updated 5 years ago
- ☆25Updated 6 years ago
- Experiments on english wikipedia. GloVe and word2vec.☆13Updated 8 years ago
- models trained to identify locations based on images☆19Updated 8 years ago
- A tool to get the arxiv papers☆19Updated 7 years ago
- MultiviewFaceDetection☆8Updated 9 months ago
- Linking Entities in CommonCrawl Dataset onto Wikipedia Concepts☆59Updated 12 years ago
- Command-line tool to extract a ranked list of relevant keywords from a corpus with the option of using either topic modeling or tf-idf sc…☆41Updated 7 years ago
- End-2-end multi-label classification in python☆34Updated 2 years ago
- An OCR(Optical Character Recognition) System implemented to recognize the character in a resume or cv.☆18Updated 8 years ago
- Deep neural parser for database query☆19Updated 2 years ago
- ☆34Updated 7 years ago
- Generates the most important key-phrase/key-words from a document based on a corpus☆11Updated 5 months ago