This repository contains the code, data, and models of the paper titled "CrossSum: Beyond English-Centric Cross-Lingual Summarization for 1,500+ Language Pairs" published in Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL’23), July 9-14, 2023.
☆53Mar 26, 2024Updated last year
Alternatives and similar repositories for CrossSum
Users that are interested in CrossSum are comparing it to the libraries listed below
Sorting:
- This repository contains the official release of the model "BanglaT5" and associated downstream finetuning code and datasets introduced i…☆85Jul 31, 2023Updated 2 years ago
- This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 4…☆277Mar 26, 2024Updated last year
- ☆24Nov 27, 2021Updated 4 years ago
- This repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced…☆248Jan 24, 2023Updated 3 years ago
- This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batc…☆36May 7, 2024Updated last year
- ☆31Apr 21, 2023Updated 2 years ago
- Bangla text corrector app.☆35Dec 31, 2024Updated last year
- Surprisal calculation using HuggingFace LMs ("Frequency Explains the Inverse Correlation of Large Language Models’ Size, Training Data Am…☆21Mar 7, 2024Updated last year
- Pytorch implementation for paper 'BANNER: A Cost-Sensitive Contextualized Model for Bangla Named Entity Recognition'☆13Apr 15, 2020Updated 5 years ago
- ☆38Jan 13, 2024Updated 2 years ago
- Code for the ACL2022 main conference paper "A Variational Hierarchical Model for Neural Cross-Lingual Summarization"☆18Sep 5, 2022Updated 3 years ago
- Code for the paper "A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis"☆19Jun 12, 2025Updated 8 months ago
- Code and data for ACL2021 paper Cross-Lingual Abstractive Summarization with Limited Parallel Resources.☆46Aug 13, 2021Updated 4 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆99Mar 14, 2025Updated 11 months ago
- A large dataset of 4.2m Java source code and parallel data of their description from code search, and code summarization studies.☆55Feb 24, 2022Updated 4 years ago
- A rule-based lemmatizer for Bengali / Bangla based written in Python. Under active development.☆25Dec 28, 2019Updated 6 years ago
- This repository contains the code and data of the paper titled "Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Da…☆152Oct 23, 2024Updated last year
- AvroPad☆28Dec 25, 2016Updated 9 years ago
- EMNLP 2022: ClidSum: A Benchmark Dataset for Cross-Lingual Dialogue Summarization☆36Jan 13, 2024Updated 2 years ago
- An Android app about my dept. You can find All contact information's of both teacher's and students here in this app including phone numb …☆10Jan 17, 2022Updated 4 years ago
- ☆17Sep 10, 2025Updated 5 months ago
- Compile-time string encryption and import obfuscation for Windows PE32(+) binaries☆16Jan 18, 2026Updated last month
- Machine learning algorithms implements with jax for machine learning in production in large scale dataset.☆14Updated this week
- Code + Online Judge : Coding made sohoj☆10Dec 25, 2025Updated 2 months ago
- Curated list of Moroccans publishing in the most prestigious AI conferences☆10Oct 14, 2024Updated last year
- Code for COLING 2022 accepted paper titled "MuCDN: Mutual Conversational Detachment Network for Emotion Recognition in Multi-Party Conver…☆10Jul 21, 2023Updated 2 years ago
- ☆10Jun 23, 2018Updated 7 years ago
- Different bangla datasets for sentiment analysis on bangla text☆10Nov 26, 2022Updated 3 years ago
- Epub Highlighter highlights specified words in EPub w/o meaning.☆11Jul 26, 2017Updated 8 years ago
- R Interface for CrowdTangle Facebook API☆10Oct 27, 2021Updated 4 years ago
- Inspired by the Tryhackme.com Room "Python for Pentesters"☆10Jun 9, 2024Updated last year
- Dialog Acts SEGmentation: Tools for dialog act research☆14Mar 21, 2025Updated 11 months ago
- Multilingual Language Modeling Toolkit☆11May 25, 2017Updated 8 years ago
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆43Mar 8, 2023Updated 2 years ago
- ☆39Aug 14, 2025Updated 6 months ago
- Lightweight static website generator: low-ceremony generic file processor with proven javascript tools.☆13Feb 14, 2026Updated 2 weeks ago
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 7 months ago
- Onsen UI 2.0 example with Material Design Tabbar☆12Jan 24, 2016Updated 10 years ago
- Source codes for the paper "Multi-View Sequence-to-Sequence Models with Conversational Structure for Abstractive Dialogue Summarization"☆91Oct 27, 2023Updated 2 years ago