This is a package in Python which implements a tokenizer, stemmer for Hindi language
☆95Oct 2, 2020Updated 5 years ago
Alternatives and similar repositories for hindi-tokenizer
Users that are interested in hindi-tokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Python based API to access Indian language WordNets.☆38Apr 26, 2022Updated 3 years ago
- Resources and tools for Indian language Natural Language Processing☆632Jun 7, 2024Updated last year
- Semi-supervised POS tagger for Sanskrit☆10Aug 22, 2016Updated 9 years ago
- Collects product data from bigbasket.com☆11Apr 8, 2017Updated 8 years ago
- A text summarization tool for Marathi implemented as a project for course Adavanced NLP (CSCI 544)☆16Apr 29, 2017Updated 8 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Sanskrit compound segmentation using seq2seq model☆26Sep 29, 2018Updated 7 years ago
- A collection of basic text processing modules focused on Gujarati☆10Oct 24, 2017Updated 8 years ago
- Xlit-Crowd: Hindi-English Transliteration Corpus☆38Feb 17, 2015Updated 11 years ago
- State-of-the-Art Language Modeling and Text Classification in Hindi Language☆219Mar 9, 2019Updated 7 years ago
- A rule-based lemmatizer for Bengali / Bangla based written in Python. Under active development.☆25Dec 28, 2019Updated 6 years ago
- A collaborative catalog of NLP resources for Indic languages☆629Dec 14, 2024Updated last year
- Hindi NLP work☆14Apr 4, 2022Updated 3 years ago
- Solutions for various datasets and contests on Kaggle☆14Oct 22, 2018Updated 7 years ago
- a repository containing the details of natural language inference dataset in Hindi☆14Dec 28, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- The project aims on adding a state-of-the-art transliteration module for cross transliterations among all Indian languages including Engl…☆274Oct 28, 2022Updated 3 years ago
- YOLO Algorithm (Yolov2 model) trained on COCO Dataset for Object Detection☆26Nov 22, 2019Updated 6 years ago
- ☆13Apr 12, 2024Updated last year
- The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the datase…☆206May 27, 2020Updated 5 years ago
- ACL Rolling Review website☆11Updated this week
- The repository contains all the codes necessary for my project - Automatic Speech Recognition System in Hindi Language ( Project descript…☆28Nov 29, 2019Updated 6 years ago
- Experiments for recognising textual entailment☆14Oct 12, 2012Updated 13 years ago
- A simple tutorial on setting up Sparrowhawk - a text-to-speech normalization engine☆14Oct 16, 2017Updated 8 years ago
- A python library for easily querying morphological inflection models trained on Unimorph☆13Oct 23, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Format conversion and graphical representation of [Universal Dependencies](http://universaldependencies.org) trees.☆12Sep 3, 2024Updated last year
- A quick Tensorflow implementation of Facebook FairSeq[1] for character-level neural machine translation (EN -> JP).☆14May 5, 2018Updated 7 years ago
- 🗺️ OpenStreetMap Countries GeoJSON — updated daily!☆18Aug 17, 2025Updated 7 months ago
- Easy-first dependency parser based on Hierarchical Tree LSTMs☆32Nov 30, 2016Updated 9 years ago
- ☆27Jan 7, 2017Updated 9 years ago
- State of the Art Language models and Classifier for Hindi language (spoken in Indian sub-continent)☆123Aug 5, 2020Updated 5 years ago
- A stream to RTL compiler based on MLIR and CIRCT☆16Nov 15, 2022Updated 3 years ago
- A simple python script that emulates the experiment in Tomas Milokov's paper "Exploiting Similarities among Languages for Machine Transla…☆14Oct 4, 2015Updated 10 years ago
- ☆11Feb 8, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A tokenizer for Icelandic text.☆30Dec 12, 2025Updated 3 months ago
- This repository is to download the SEMCATdataset 2018 for the publication "Senel L. K., Utlu I., Yucesoy V., Koc A., Cukur T., Semantic S…☆10Sep 18, 2020Updated 5 years ago
- SyPhon: Constraint-based Learning of Phonological Rules☆11Mar 5, 2025Updated last year
- Course code for my Deep Learning Computer Vision Course☆25Mar 24, 2020Updated 6 years ago
- Definitive Screening design of experiments☆13May 28, 2024Updated last year
- A visualisation tool for Spacy using Hierplane.☆64Jan 25, 2023Updated 3 years ago
- This repository makes the integral Let's Go dataset publicly available.☆45Jun 15, 2023Updated 2 years ago