oya163 / nepali-ner
Named Entity Recognition in Nepali Language
☆10Updated 2 years ago
Alternatives and similar repositories for nepali-ner:
Users that are interested in nepali-ner are comparing it to the libraries listed below
- This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020…☆33Updated 3 years ago
- A Python based API to access Indian language WordNets.☆37Updated 2 years ago
- Description Describes the IndicNLP corpus and associated datasets☆161Updated last year
- Hinglish Text Classification☆30Updated last year
- Fast and accurate spell correction library☆79Updated 2 years ago
- Tutorial on English to Hindi Transliteration using Seq2Seq Architecture in Tensorflow☆16Updated 5 years ago
- indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2☆122Updated last year
- Corpus and a baseline neural network system for Named Entity Recognition in Hindi-English Code-Mixed social media text.☆45Updated 4 years ago
- Code for extracting parallel corpora from pmindia☆16Updated 5 years ago
- Indian Language Tagger and Chunker (Hindi, Telugu, Tamil, Marathi, Punjabi, Kanada, Malayalam, Urdu, Bengali)☆41Updated last year
- Pre-trained, multilingual sequence-to-sequence models for Indian languages☆45Updated 2 years ago
- This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.☆34Updated 2 years ago
- Tutorial for first time BERT users,☆102Updated 2 years ago
- ☆107Updated last year
- Paraphrase any question with T5 (Text-To-Text Transfer Transformer) - Pretrained model and training script provided☆188Updated last year
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆51Updated 4 years ago
- HateEval 2019 - Task 5☆15Updated 5 years ago
- This repository contains the HiNER dataset released with our paper at LREC 2022☆15Updated last year
- The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the datase…☆192Updated 4 years ago
- Code for experiments done for EMNLP2020.☆11Updated 2 years ago
- Curated list of publicly available parallel corpus for Indian Languages☆30Updated 3 years ago
- ☆16Updated 3 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- State of the Art Language models and Classifier for Hindi language (spoken in Indian sub-continent)☆123Updated 4 years ago
- A web application that interfaces two GEC systems. [web instance is down]☆31Updated 5 months ago
- A benchmark for code-switched NLP, ACL 2020☆74Updated 8 months ago
- ⛔ [NOT MAINTAINED] A web-based annotator for closed-domain question answering datasets with SQuAD format.☆88Updated 2 years ago
- [EMNLP-Findings 2020] Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences☆62Updated 8 months ago
- ☆48Updated 5 years ago
- A pipeline for transliteration, spell correction, POS tagging and word sense disambiguation of Hinglish code mixed data to Hindi Devanaga…☆35Updated last year