psankar / korkai
A corpus builder for Tamil by analyzing wordpress, blogger, wikipedia dumps
☆20Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for korkai
- Avalokitam: Tamil Prosody Analyzer☆25Updated 7 months ago
- Tamil Dictionary☆18Updated last year
- தமி ழில் இயல்மொழி ஆய்வுக்கான நிரல்கள், கருவிகள் மற்றும் தரவுகள்☆71Updated 4 months ago
- Tamil Language words list☆10Updated 8 years ago
- பைந்தமிழ் (pytamil) library is intended to be used in analysis of tamil literary work. A wealth of knowledge is hidden in old literature.…☆51Updated 6 months ago
- Collections-Tamil-Tanslation☆25Updated last year
- A rule-based iterative affix stripping stemmer for Tamil☆43Updated 6 years ago
- தமிழில் உள்ள பொதுவெளி தரவுகள், நிரல் திரட்டுகள், மற்றும் மென்பொருள்கள்.☆41Updated 7 months ago
- Open Source Tamil NLP Tools - தமிழ் இயற்கை மொழி பகுப்பாய்வு நிரல்தொகுப்பு☆266Updated 2 weeks ago
- ☆32Updated 3 years ago
- Indian Language Tagger and Chunker (Hindi, Telugu, Tamil, Marathi, Punjabi, Kanada, Malayalam, Urdu, Bengali)☆40Updated last year
- எழில் - ஒரு தமிழ் நிரலாக்க மொழி; தமிழ் மாணவர்களுக்கு இது முதல்முறை கணிப்பொறி நிரல் ஏழுத உதவும் (Ezhil, is a fun Tamil programming languag…☆173Updated 5 months ago
- English - Tamil Dictionary - Offline☆12Updated 8 years ago
- Python Interface to Cologne Digital Sanskrit Lexicon (CDSL)☆12Updated 2 years ago
- KNphone is a phonetic algorithm for indexing Kannada words by their pronunciation, like Metaphone for English.☆47Updated 5 years ago
- language model for tamil☆49Updated 3 years ago
- tamil-sandhi-checker☆53Updated 3 years ago
- ☆11Updated 5 years ago
- Python package for indic script transliteration☆165Updated last month
- To get all the tamil words from the tamil wikipedia☆24Updated 2 months ago
- ASCII to Unicode encoding converter for Kannada.☆73Updated last year
- ஹன்ஸ்பெல் தமிழ் சொற்பிழைத்திருத்தி☆26Updated 10 years ago
- Codebase for Indic-Transliteration using Seq2Seq RNN. For latest repo with Transformer-based models, check: https://github.com/AI4Bharat/…☆58Updated 3 years ago
- ☆45Updated this week
- A collection of basic text processing modules focused on Gujarati☆10Updated 7 years ago
- State of the Art Language models and Classifier for Tamil language (spoken in India, and few other South Asian countries)☆53Updated 4 years ago
- Awesome List of Tamil NLP & AI Resources☆103Updated last year
- This is a package in Python which implements a tokenizer, stemmer for Hindi language☆91Updated 4 years ago
- ☆31Updated 5 years ago
- Description Describes the IndicNLP corpus and associated datasets☆157Updated last year