KaniyamFoundation / Pdf2Text
Project to convert PDF files to Text files using google OCR
☆12Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for Pdf2Text
- Tamil Language words list☆10Updated 8 years ago
- OCR for WikiSource using Google Drive OCR☆33Updated 5 months ago
- ThamizhiMorph: A Tamil Morphological Analyser and Generator☆15Updated 11 months ago
- தமிழில் இயல்மொழி ஆய்வுக்கான நிரல்கள், கருவிகள் மற்றும் தரவுகள்☆71Updated 4 months ago
- The e-texts of the SARIT project☆39Updated 5 months ago
- Data for the quantitative study of (Vedic) Sanskrit☆111Updated 3 weeks ago
- Transliteration module for Indian Languages☆77Updated last year
- Python Interface to Cologne Digital Sanskrit Lexicon (CDSL)☆12Updated 2 years ago
- A rule-based iterative affix stripping stemmer for Tamil☆43Updated 6 years ago
- Resources to go with the Indic NLP Library☆72Updated 2 years ago
- Snapshots of the GRETIL repository of South Asian (Sanskrit, Pali, etc.) etexts☆9Updated 2 years ago
- ☆45Updated this week
- Versioned Sanskrit linguistic data☆17Updated 2 weeks ago
- Aksharamukha☆161Updated 3 months ago
- A collection of basic text processing modules focused on Gujarati☆10Updated 7 years ago
- sanskrit monolingual corpus☆18Updated 7 years ago
- Repository to store Sanskrit koshas and scripts to process them.☆25Updated 8 months ago
- Create a web service for step by step derivation of verb forms of Sanskrit language.☆16Updated 10 months ago
- A general-purpose Sanskrit library☆63Updated 6 years ago
- Parsers for Sanskrit / संस्कृतम्☆69Updated last year
- A Python based API to access Indian language WordNets.☆37Updated 2 years ago
- Codebase for Indic-Transliteration using Seq2Seq RNN. For latest repo with Transformer-based models, check: https://github.com/AI4Bharat/…☆58Updated 3 years ago
- Data powering ashtadhyayi.com☆42Updated this week
- ☆31Updated 5 years ago
- Verb declention for Sanskrit☆41Updated last year
- Aksharamukha Python Library☆43Updated last month
- Data for all dictionaries of Cologne. Now all corrections are made in this git-based workflow.☆14Updated this week
- An OCR for classical Sanskrit document images☆44Updated last year
- Stardict dictionary files for the Sanskrit language.☆76Updated this week
- Python package for indic script transliteration☆165Updated last month