KaniyamFoundation / Pdf2Text
Project to convert PDF files to Text files using google OCR
☆12Updated 9 months ago
Alternatives and similar repositories for Pdf2Text:
Users that are interested in Pdf2Text are comparing it to the libraries listed below
- Tamil Language words list☆10Updated 8 years ago
- Python Interface to Cologne Digital Sanskrit Lexicon (CDSL)☆13Updated 2 years ago
- An OCR for classical Sanskrit document images☆48Updated 2 years ago
- Transliteration module for Indian Languages☆77Updated last year
- Main application code for Ambuda, a breakthrough Sanskrit library (ambuda.org)☆95Updated 3 months ago
- OCR for WikiSource using Google Drive OCR☆33Updated 8 months ago
- A rule-based iterative affix stripping stemmer for Tamil☆43Updated 6 years ago
- Data for the quantitative study of (Vedic) Sanskrit☆117Updated 3 months ago
- ☆32Updated 3 years ago
- Vaiyyākaraṇaḥ is a telegram bot that offers various tools for a Sanskrit learner including stem (प्रातिपदिकम्) finder, root (धातुः) finde…☆13Updated 3 months ago
- Aksharamukha Python Library☆44Updated 2 weeks ago
- sanskrit monolingual corpus☆18Updated 8 years ago
- A project to collect all tamil nouns☆10Updated 2 months ago
- The e-texts of the SARIT project☆40Updated 8 months ago
- State of the Art Language models and Classifier for Sanskrit language (ancient indian language)☆80Updated 5 years ago
- Versioned Sanskrit linguistic data☆17Updated 3 months ago
- Python package for indic script transliteration☆171Updated last month
- Machine Translation from Sanskrit to Hindi using Unsupervised and Supervised Learning☆18Updated 4 years ago
- Data for all dictionaries of Cologne. Now all corrections are made in this git-based workflow.☆14Updated this week
- ☆48Updated this week
- Repository to store Sanskrit koshas and scripts to process them.☆28Updated 11 months ago
- A Python based API to access Indian language WordNets.☆39Updated 2 years ago
- Snapshots of the GRETIL repository of South Asian (Sanskrit, Pali, etc.) etexts☆9Updated 2 years ago
- தமிழில் இயல்மொழி ஆய்வுக்கான நிரல்கள், கருவிகள் மற்றும் தரவுகள்☆72Updated last month
- Codebase for Indic-Transliteration using Seq2Seq RNN. For latest repo with Transformer-based models, check: https://github.com/AI4Bharat/…☆60Updated 3 years ago
- Parsers for Sanskrit / संस्कृतम्☆70Updated last year
- ☆20Updated this week
- This is a collection of sentence-level aligned Sanskrit-Tibetan Etexts.☆14Updated 2 years ago
- ThamizhiMorph: A Tamil Morphological Analyser and Generator☆16Updated last year
- A general-purpose Sanskrit library☆66Updated 7 years ago