MISabic / NER-Bangla-Dataset
Dataset for Bangla named entity recognition
☆7Updated 3 years ago
Alternatives and similar repositories for NER-Bangla-Dataset:
Users that are interested in NER-Bangla-Dataset are comparing it to the libraries listed below
- This repository contains the code, data, and models of the paper titled "CrossSum: Beyond English-Centric Cross-Lingual Summarization for…☆49Updated last year
- Classification Benchmarks for Under-resourced Bengali Language based on Multichannel Convolutional-LSTM Network☆20Updated 3 years ago
- Pytorch implementation for paper 'BANNER: A Cost-Sensitive Contextualized Model for Bangla Named Entity Recognition'☆14Updated 4 years ago
- Resources and Tool for Bangla language computation☆14Updated last year
- ☆35Updated last year
- ☆47Updated 2 years ago
- Dataset of ML and NLP papers☆35Updated 2 years ago
- This repository contains the HiNER dataset released with our paper at LREC 2022☆14Updated last year
- Pretraining scripts for BART transformer model☆11Updated last year
- Automatic Context Sensitive Spelling Correction for Bangla Text Using Bert and Levenstein Distance☆20Updated 4 months ago
- ☆14Updated 2 years ago
- Statistics on multilingual datasets☆17Updated 2 years ago
- This is the official repository of the paper "Query Focused Abstractive Summarization via Incorporating Query Relevance and Transfer Lear…☆16Updated 4 years ago
- ☆15Updated 3 years ago
- Zero-shot Transfer Learning from English to Arabic☆29Updated 2 years ago
- This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 4…☆264Updated last year
- This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batc…☆35Updated 10 months ago
- Analyzing mBERT's multilinguality in a small laboratory setting☆13Updated last year
- Code Repository for the IndicXNLI paper.☆15Updated last year
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆52Updated 4 years ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated last year
- Bangla Unicode Normalization☆19Updated 10 months ago
- Repository for the English-Hindi Codemixed to Monolingual English Parallel Corpus☆13Updated 6 years ago
- Conversion scripts for coreference☆27Updated 5 months ago
- ☆12Updated 4 years ago
- Multilingual Dialogue Datasets☆19Updated 2 years ago
- We release a dataset based on Wikipedia sentences and the corresponding translations in 6 different languages along with the scores (scal…☆81Updated 3 years ago
- Agile reading group that works☆13Updated 3 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Updated 2 years ago
- This repositary hosts my experiments for the project, I did with OffNote Labs.☆10Updated 3 years ago