KaniyamFoundation / Pdf2TextLinks
Project to convert PDF files to Text files using google OCR
☆13Updated last year
Alternatives and similar repositories for Pdf2Text
Users that are interested in Pdf2Text are comparing it to the libraries listed below
Sorting:
- OCR for WikiSource using Google Drive OCR☆34Updated last year
- Tamil Language words list☆11Updated 9 years ago
- ThamizhiMorph: A Tamil Morphological Analyser and Generator☆18Updated last year
- A project to collect all tamil nouns☆11Updated 7 months ago
- A cloud-based, open-source system for writing and publishing dictionaries.☆93Updated last year
- A rule-based iterative affix stripping stemmer for Tamil☆44Updated last month
- Aksharamukha☆184Updated 3 months ago
- ☆14Updated 4 years ago
- Resources to go with the Indic NLP Library☆73Updated 3 years ago
- Data for the quantitative study of (Vedic) Sanskrit☆126Updated last week
- Main application code for Ambuda, a breakthrough Sanskrit library (ambuda.org)☆98Updated 8 months ago
- Karthika - A offline Tamil Wiktionary in Python☆17Updated 13 years ago
- Machine Translation from Sanskrit to Hindi using Unsupervised and Supervised Learning☆19Updated 4 years ago
- Python package for indic script transliteration☆189Updated this week
- Versioned Sanskrit linguistic data☆18Updated 8 months ago
- A collection of basic text processing modules focused on Gujarati☆10Updated 7 years ago
- This is a collection of sentence-level aligned Sanskrit-Tibetan Etexts.☆15Updated 3 years ago
- A Directory of Online Newspaper Sources for 70+ Languages☆32Updated 4 years ago
- A context-based spellchecker for correcting OCR output.☆20Updated 2 years ago
- Transliteration module for Indian Languages☆78Updated last year
- Script to compile and install Tamil TTS system from IITM donlab☆31Updated 6 years ago
- An OCR for classical Sanskrit document images☆52Updated 2 years ago
- தமிழில் இயல்மொழி ஆய்வுக்கான நிரல்கள், கருவிகள் மற்றும் தரவுகள்☆73Updated 4 months ago
- The e-texts of the SARIT project☆40Updated last month
- Data powering ashtadhyayi.com☆49Updated last month
- Automated paraphrases Generation☆36Updated 2 years ago
- Natural language processing resources for multiple languages, with an eye towards use for digital humanities.☆127Updated 4 years ago
- Comparable documents miner: Arabic-English morphological analysis, text processing, n-gram features extraction, POS tagging, dictionary t…☆35Updated 8 years ago
- Align various Sanskrit texts and audio☆15Updated last week
- Parsers for Sanskrit / संस्कृतम्☆76Updated last year