KaniyamFoundation / Pdf2TextLinks
Project to convert PDF files to Text files using google OCR
☆13Updated last year
Alternatives and similar repositories for Pdf2Text
Users that are interested in Pdf2Text are comparing it to the libraries listed below
Sorting:
- Tamil Language words list☆11Updated 9 years ago
- ThamizhiMorph: A Tamil Morphological Analyser and Generator☆20Updated last year
- A project to collect all tamil nouns☆11Updated 8 months ago
- OCR for WikiSource using Google Drive OCR☆34Updated last year
- Data for the quantitative study of (Vedic) Sanskrit☆128Updated 2 weeks ago
- A Python based API to access Indian language WordNets.☆38Updated 3 years ago
- A collection of basic text processing modules focused on Gujarati☆10Updated 7 years ago
- The e-texts of the SARIT project☆40Updated 2 months ago
- Resources to go with the Indic NLP Library☆75Updated 3 years ago
- An OCR for classical Sanskrit document images☆52Updated 2 years ago
- A cloud-based, open-source system for writing and publishing dictionaries.☆93Updated last year
- ☆33Updated 4 years ago
- Parsers for Sanskrit / संस्कृतम्☆78Updated 3 weeks ago
- ☆14Updated 4 years ago
- ☆24Updated last week
- A rule-based iterative affix stripping stemmer for Tamil☆44Updated 3 months ago
- Aksharamukha☆189Updated 5 months ago
- Transliteration module for Indian Languages☆79Updated last year
- Machine Translation from Sanskrit to Hindi using Unsupervised and Supervised Learning☆19Updated 4 years ago
- A general-purpose Sanskrit library☆67Updated 7 years ago
- Stardict dictionary files for the Sanskrit language.☆86Updated last week
- Data powering ashtadhyayi.com☆51Updated this week
- Simple Python GUI Tool for Tesseract4☆15Updated 5 years ago
- Hindi wordlists, dictionary and affix files in hunspell format☆39Updated 4 years ago
- Python package for indic script transliteration☆192Updated last week
- Automation of google ocr through gdcmdtools library.☆22Updated 7 years ago
- Karthika - A offline Tamil Wiktionary in Python☆17Updated 13 years ago
- This is a collection of sentence-level aligned Sanskrit-Tibetan Etexts.☆15Updated 3 years ago
- Official releases of the PROIEL treebank of ancient Indo-European languages☆37Updated 2 years ago
- Main application code for Ambuda, a breakthrough Sanskrit library (ambuda.org)☆102Updated 10 months ago