Updating collection of summarization datasets in 100+ languages, based on our paper "The State and Fate of Summarization Datasets: A Survey".
☆30Apr 29, 2025Updated 10 months ago
Alternatives and similar repositories for Awesome-Summarization-Datasets
Users that are interested in Awesome-Summarization-Datasets are comparing it to the libraries listed below
Sorting:
- An official implementation of ProbeGen☆13Oct 20, 2024Updated last year
- ☆14Dec 1, 2025Updated 3 months ago
- Official PyTorch Implementation for the "Unsupervised Model Tree Heritage Recovery" paper (ICLR 2025).☆63Jul 1, 2025Updated 8 months ago
- State of What Art? A Call for Multi-Prompt LLM Evaluation☆15Jul 10, 2024Updated last year
- A curated collection of prompts for Grok Imagine by xAI☆23Oct 19, 2025Updated 4 months ago
- TEAL: New Selection Strategy for Small Buffers in Experience Replay Class Incremental Learning☆17Jan 21, 2025Updated last year
- A Pytorch Lightning WGAN-gp to generate faces☆11Jan 26, 2021Updated 5 years ago
- Neural embeddings with negative sampling in Keras☆11Jun 11, 2017Updated 8 years ago
- Awesome Multimodal Fusion in Speech Emotion Recognition☆13Nov 11, 2025Updated 3 months ago
- Source code of "Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers" EMNLP 2025☆16Jan 12, 2026Updated last month
- ☆10Jun 13, 2020Updated 5 years ago
- Repository for "CoMix: Comprehensive Benchmark for Multi-Task Comic Understanding"☆16Nov 20, 2024Updated last year
- Human Evaluation Benchmark for Text Simplification☆10Sep 6, 2018Updated 7 years ago
- ☆26Nov 7, 2022Updated 3 years ago
- ☆14Apr 23, 2025Updated 10 months ago
- Python interface for the Berkeley Parser using JPype☆12Dec 18, 2015Updated 10 years ago
- Local text-to-speech in your browser with Piper TTS☆17Aug 13, 2025Updated 6 months ago
- Python client for Jikan.moe, MyAnimeList unofficial API with good intentions.☆14Dec 20, 2022Updated 3 years ago
- Repository of the RANLP 2023 paper "Exploring the Landscape of Natural Language Processing Research".☆13Oct 20, 2024Updated last year
- hydra-pl-wandb-sample-project is a NN experiment management code using hydra, pytorch-lightinig, and wandb.☆11Nov 22, 2021Updated 4 years ago
- Filling the Gaps in Ancient Akkadian Texts:A Masked Language Modelling Approach, Lazar et al., EMNLP 2021☆13Nov 10, 2022Updated 3 years ago
- Deploy docs from your source tree to a GitHub wiki☆13Jun 14, 2023Updated 2 years ago
- Codebase for multilingual neural machine translation☆13Nov 24, 2022Updated 3 years ago
- Emergent Communication Pretraining for Few-Shot Machine Translation☆13Dec 3, 2020Updated 5 years ago
- Guidelines for our secondary layer of annotation adding multi-sentence AMR links☆12Sep 6, 2017Updated 8 years ago
- Japanese Morphological Analyzer written in pure Dart.☆13Jun 5, 2021Updated 4 years ago
- [ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models☆16Jun 18, 2025Updated 8 months ago
- A Chainer implementation of doc2vec☆10Nov 16, 2017Updated 8 years ago
- Tools for working with QA-SRL data and annotating it with crowdsourcing.☆12Sep 22, 2023Updated 2 years ago
- Pretraining summarization models using a corpus of nonsense☆13Sep 28, 2021Updated 4 years ago
- Dependency-Based Self-Attention for Transformer NMT☆12Mar 27, 2024Updated last year
- Learning to Hash for Maximum Inner Product Search☆12Jan 21, 2022Updated 4 years ago
- Adaptation datasets and scripts for the paper "Reducing gender bias in Neural Machine Translation as a domain adaptation problem" (ACL 20…☆13Mar 18, 2021Updated 4 years ago
- Pretraining scripts for BART transformer model☆12May 15, 2023Updated 2 years ago
- This repository contains a demonstrative implementation for pooling-based models, e.g., DeepPyramidion complementing our paper "Sparsifyi…☆14May 15, 2022Updated 3 years ago
- A replication of the paper "Adaptive Mixtures of Local Experts" applied to the CIFAR-10 image classification dataset.☆12Mar 19, 2021Updated 4 years ago
- ☆14Jun 25, 2025Updated 8 months ago
- ☆15Feb 24, 2021Updated 5 years ago
- The official repo for “Semantic-guided Semantic Scene Completion”☆17Jul 18, 2024Updated last year