tshrinivasan / google-ocr-pythonLinks
Automation of google ocr through gdcmdtools library.
☆22Updated 7 years ago
Alternatives and similar repositories for google-ocr-python
Users that are interested in google-ocr-python are comparing it to the libraries listed below
Sorting:
- OCR for WikiSource using Google Drive OCR☆34Updated last year
 - Project to convert PDF files to Text files using google OCR☆13Updated last year
 - Simple Python GUI Tool for Tesseract4☆15Updated 5 years ago
 - Transliteration module for Indian Languages☆79Updated last week
 - To get all the tamil words from the tamil wikipedia☆25Updated last year
 - A Python based API to access Indian language WordNets.☆38Updated 3 years ago
 - Soundex Phonetic Code Algorithm Demo for Indian Languages. Supports all indian languages and English. Provides intra-indic string compari…☆58Updated 6 years ago
 - An expandable and scalable OCR pipeline☆88Updated 7 years ago
 - Hindi wordlists, dictionary and affix files in hunspell format☆40Updated 4 years ago
 - Resources to go with the Indic NLP Library☆76Updated 3 years ago
 - Versioned Sanskrit linguistic data☆18Updated 11 months ago
 - Codebase for Indic-Transliteration using Seq2Seq RNN. For latest repo with Transformer-based models, check: https://github.com/AI4Bharat/…☆60Updated 4 years ago
 - Python package for indic script transliteration☆196Updated last month
 - Ocular is a state-of-the-art historical OCR system.☆265Updated last year
 - Apply different text recognition services to images of handwritten documents.☆187Updated 2 years ago
 - An OCR for classical Sanskrit document images☆53Updated 2 years ago
 - Indian Language Tagger and Chunker (Hindi, Telugu, Tamil, Marathi, Punjabi, Kanada, Malayalam, Urdu, Bengali)☆42Updated 2 years ago
 - Note: the repo has been moved to https://gitlab.com/readcoop/Transkribus/TranskribusCore☆37Updated 5 years ago
 - Malayalam Morphological Analyzer using Finite State Transducer☆63Updated 9 months ago
 - Upload files to Wikimedia Commons. The Spreadsheet Way.☆61Updated 2 years ago
 - The CIS OCR PostCorrectionTool☆44Updated 2 years ago
 - A radio for Wikimedia Commons audio files☆14Updated 4 years ago
 - Laws of India in Akoma Ntoso XML format☆37Updated 6 years ago
 - This buckwalter2unicode script is designed to convert Arabic text that has been transliterated to ASCII symbols using the Buckwalter Tran…☆13Updated 13 years ago
 - The project aims on adding a state-of-the-art transliteration module for cross transliterations among all Indian languages including Engl…☆272Updated 3 years ago
 - Data for the quantitative study of (Vedic) Sanskrit☆136Updated 2 months ago
 - A small python script that transliterates Arabic text using the Buckwalter Transliteration Scheme. It allows for multiple decisions to be…☆26Updated 11 years ago
 - Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆195Updated 5 months ago
 - Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.☆35Updated 2 years ago
 - A Directory of Online Newspaper Sources for 70+ Languages☆31Updated 4 years ago