KaniyamFoundation / Pdf2TextLinks
Project to convert PDF files to Text files using google OCR
☆13Updated last year
Alternatives and similar repositories for Pdf2Text
Users that are interested in Pdf2Text are comparing it to the libraries listed below
Sorting:
- Tamil Language words list☆11Updated 9 years ago
 - A project to collect all tamil nouns☆11Updated 10 months ago
 - This is a collection of sentence-level aligned Sanskrit-Tibetan Etexts.☆15Updated 3 years ago
 - OCR for WikiSource using Google Drive OCR☆34Updated last year
 - Data for the quantitative study of (Vedic) Sanskrit☆136Updated 2 months ago
 - The e-texts of the SARIT project☆40Updated 4 months ago
 - ☆14Updated 4 years ago
 - Aksharamukha☆193Updated 7 months ago
 - A rule-based iterative affix stripping stemmer for Tamil☆44Updated 5 months ago
 - Transliteration module for Indian Languages☆79Updated last week
 - ☆32Updated 4 years ago
 - A cloud-based, open-source system for writing and publishing dictionaries.☆95Updated last year
 - Simple Python GUI Tool for Tesseract4☆15Updated 5 years ago
 - Parsers for Sanskrit / संस्कृतम्☆82Updated last month
 - Machine Translation from Sanskrit to Hindi using Unsupervised and Supervised Learning☆19Updated 4 years ago
 - Resources to go with the Indic NLP Library☆76Updated 3 years ago
 - Vaiyyākaraṇaḥ is a telegram bot that offers various tools for a Sanskrit learner including stem (प्रातिपदिकम्) finder, root (धातुः) finde…☆14Updated 11 months ago
 - Hindi wordlists, dictionary and affix files in hunspell format☆40Updated 4 years ago
 - A collection of basic text processing modules focused on Gujarati☆10Updated 8 years ago
 - A Python based API to access Indian language WordNets.☆38Updated 3 years ago
 - An OCR for classical Sanskrit document images☆53Updated 2 years ago
 - ThamizhiMorph: A Tamil Morphological Analyser and Generator☆20Updated last year
 - Automation of google ocr through gdcmdtools library.☆22Updated 7 years ago
 - Codebase for Indic-Transliteration using Seq2Seq RNN. For latest repo with Transformer-based models, check: https://github.com/AI4Bharat/…☆60Updated 4 years ago
 - Versioned Sanskrit linguistic data☆18Updated 11 months ago
 - Data powering ashtadhyayi.com☆52Updated last week
 - ☆30Updated 6 years ago
 - Stardict dictionary files for the Sanskrit language.☆87Updated 2 months ago
 - Python package for indic script transliteration☆196Updated last month
 - தமிழில் இயல்மொழி ஆய்வுக்கான நிரல்கள், கருவிகள் மற்றும் தரவுகள்☆74Updated 8 months ago