KaniyamFoundation / Pdf2Text
Project to convert PDF files to Text files using google OCR
☆12Updated 10 months ago
Alternatives and similar repositories for Pdf2Text:
Users that are interested in Pdf2Text are comparing it to the libraries listed below
- Vaiyyākaraṇaḥ is a telegram bot that offers various tools for a Sanskrit learner including stem (प्रातिपदिकम्) finder, root (धातुः) finde…☆13Updated 4 months ago
- Transliteration module for Indian Languages☆78Updated last year
- sanskrit monolingual corpus☆19Updated 8 years ago
- Python Interface to Cologne Digital Sanskrit Lexicon (CDSL)☆14Updated 2 years ago
- A general-purpose Sanskrit library☆66Updated 7 years ago
- Resources to go with the Indic NLP Library☆73Updated 2 years ago
- ☆22Updated last week
- Versioned Sanskrit linguistic data☆17Updated 4 months ago
- Data for the quantitative study of (Vedic) Sanskrit☆118Updated 5 months ago
- ThamizhiMorph: A Tamil Morphological Analyser and Generator☆17Updated last year
- Tamil Language words list☆11Updated 8 years ago
- Align various Sanskrit texts and audio☆14Updated last year
- Parsers for Sanskrit / संस्कृतम्☆71Updated last year
- Machine Translation from Sanskrit to Hindi using Unsupervised and Supervised Learning☆19Updated 4 years ago
- A rule-based iterative affix stripping stemmer for Tamil☆44Updated 6 years ago
- Hindi wordlists, dictionary and affix files in hunspell format☆40Updated 4 years ago
- Various commentaries on Ashtadhyayi of Panini.☆25Updated last week
- Main application code for Ambuda, a breakthrough Sanskrit library (ambuda.org)☆95Updated 5 months ago
- Python package for indic script transliteration☆175Updated this week
- An OCR for classical Sanskrit document images☆48Updated 2 years ago
- Sanskrit compound segmentation using seq2seq model☆25Updated 6 years ago
- Repository to store Sanskrit koshas and scripts to process them.☆28Updated last month
- Toolkit for manipulating Sanskrit text with Python☆14Updated 3 months ago
- OCR for WikiSource using Google Drive OCR☆33Updated 9 months ago
- The e-texts of the SARIT project☆40Updated 10 months ago
- தமிழில் இயல்மொழி ஆய்வுக்கான நிரல்கள், கருவிகள் மற்றும் தரவுகள்☆73Updated 3 weeks ago
- ☆48Updated this week
- Aksharamukha Python Library☆44Updated last month
- Snapshots of the GRETIL repository of South Asian (Sanskrit, Pali, etc.) etexts☆9Updated 2 years ago
- Codebase for Indic-Transliteration using Seq2Seq RNN. For latest repo with Transformer-based models, check: https://github.com/AI4Bharat/…☆60Updated 3 years ago