csebuetnlp / CrossSum
This repository contains the code, data, and models of the paper titled "CrossSum: Beyond English-Centric Cross-Lingual Summarization for 1,500+ Language Pairs" published in Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL’23), July 9-14, 2023.
☆50Updated last year
Alternatives and similar repositories for CrossSum:
Users that are interested in CrossSum are comparing it to the libraries listed below
- This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 4…☆267Updated last year
- ☆47Updated 2 years ago
- Dataset for Bangla named entity recognition☆7Updated 3 years ago
- This repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced…☆241Updated 2 years ago
- This repository contains the official release of the model "BanglaT5" and associated downstream finetuning code and datasets introduced i…☆83Updated last year
- Bangla Unicode Normalization☆19Updated 11 months ago
- ☆35Updated last year
- Multilingual abstractive summarization dataset extracted from WikiHow.☆90Updated last month
- This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batc…☆35Updated 11 months ago
- ☆27Updated 4 months ago
- ☆15Updated 6 years ago
- Bangla-Bert is a pretrained bert model for Bengali language☆78Updated last year
- Code and Data for the ACL 2022 paper "Rethinking Self-Supervision Objectives for Generalizable Coherence Modeling"☆11Updated 3 years ago
- Ranking of Top Institutes for Natural Language Processing (NLP)☆22Updated 5 years ago
- Pytorch implementation for paper 'BANNER: A Cost-Sensitive Contextualized Model for Bangla Named Entity Recognition'☆14Updated 5 years ago
- This repo supports various cross-lingual transfer learning & multilingual NLP models.☆92Updated last year
- Resources and Tool for Bangla language computation☆14Updated last year
- Zero-shot Transfer Learning from English to Arabic☆29Updated 2 years ago
- [EMNLP'21] Mirror-BERT: Converting Pretrained Language Models to universal text encoders without labels.☆79Updated 2 years ago
- Classification Benchmarks for Under-resourced Bengali Language based on Multichannel Convolutional-LSTM Network☆20Updated 3 years ago
- Data and code for our paper "Exploring and Predicting Transferability across NLP Tasks", to appear at EMNLP 2020.☆50Updated 4 years ago
- Code for HypMix EMNLP 2021 (main)☆24Updated 3 years ago
- This dataset contains human judgements about answer equivalence. The data is based on SQuAD (Stanford Question Answering Dataset), and co…☆23Updated 2 years ago
- Collection of scripts to pretrain T5 in unsupervised text, using PyTorch Lightning. CORD-19 pretraining provided as example.☆32Updated 4 years ago
- Code for NAACL 2021 full paper "Efficient Attentions for Long Document Summarization"☆67Updated 3 years ago
- ☆71Updated 3 years ago
- The code repository for NAACL 2021 paper "AdaptSum: Towards Low-Resource Domain Adaptation for Abstractive Summarization".☆34Updated 3 years ago
- Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."☆139Updated 2 years ago
- This is the repository of Heterogeneous Transformer with Sparse Attention forLong-Text Extractive Summarization☆16Updated 3 years ago
- Source code for paper "Learning from Noisy Labels for Entity-Centric Information Extraction", EMNLP 2021☆55Updated 3 years ago