edahanoam / Awesome-Summarization-DatasetsView external linksLinks
Updating collection of summarization datasets in 100+ languages, based on our paper "The State and Fate of Summarization Datasets: A Survey".
☆29Apr 29, 2025Updated 9 months ago
Alternatives and similar repositories for Awesome-Summarization-Datasets
Users that are interested in Awesome-Summarization-Datasets are comparing it to the libraries listed below
Sorting:
- An official implementation of ProbeGen☆13Oct 20, 2024Updated last year
- ☆14Dec 1, 2025Updated 2 months ago
- Official PyTorch Implementation for the "Unsupervised Model Tree Heritage Recovery" paper (ICLR 2025).☆63Jul 1, 2025Updated 7 months ago
- State of What Art? A Call for Multi-Prompt LLM Evaluation☆15Jul 10, 2024Updated last year
- Annotatability, a method to identify meaningful patterns in single-cell genomics data through annotation-trainability analysis, which est…☆19Jun 23, 2025Updated 7 months ago
- Official implementation of "Dataset Size Recovery from LoRA Weights" paper.☆34Jun 30, 2024Updated last year
- ☆31Apr 2, 2022Updated 3 years ago
- ☆31Apr 21, 2023Updated 2 years ago
- Local text-to-speech in your browser with Piper TTS☆16Aug 13, 2025Updated 6 months ago
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆47Mar 17, 2025Updated 10 months ago
- Neural embeddings with negative sampling in Keras☆11Jun 11, 2017Updated 8 years ago
- ☆14Apr 23, 2025Updated 9 months ago
- Source code of "Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers" EMNLP 2025☆16Jan 12, 2026Updated last month
- Python interface for the Berkeley Parser using JPype☆12Dec 18, 2015Updated 10 years ago
- ☆12Dec 14, 2024Updated last year
- Repository of the RANLP 2023 paper "Exploring the Landscape of Natural Language Processing Research".☆11Oct 20, 2024Updated last year
- ☆13Jul 8, 2020Updated 5 years ago
- For ACL25 paper "WAFFLE: Multi-Modal Model for Automated Front-End Development" - by Shanchao Liang and Nan Jiang and Shangshu Qian and L…☆11May 28, 2025Updated 8 months ago
- Human Evaluation Benchmark for Text Simplification☆10Sep 6, 2018Updated 7 years ago
- Awesome Multimodal Fusion in Speech Emotion Recognition☆13Nov 11, 2025Updated 3 months ago
- A Pytorch Lightning WGAN-gp to generate faces☆11Jan 26, 2021Updated 5 years ago
- [ACL 2021 Findings] HySPA: Hybrid Span Generation for Scalable Text-to-Graph Extraction☆10Sep 16, 2021Updated 4 years ago
- lightsmile个人的用于爬取网络公开语料数据的mini通用爬虫框架。☆13Sep 30, 2020Updated 5 years ago
- A small python library to parse and write TSV files generated by the WebAnno software.☆12Apr 14, 2025Updated 10 months ago
- Japanese Morphological Analyzer written in pure Dart.☆13Jun 5, 2021Updated 4 years ago
- Python client for Jikan.moe, MyAnimeList unofficial API with good intentions.☆14Dec 20, 2022Updated 3 years ago
- Codebase for multilingual neural machine translation☆13Nov 24, 2022Updated 3 years ago
- Command-line (CLI) coffee journal designed for coffee enthusiasts. (https://codeberg.org/mrus/kopi)☆14Dec 15, 2025Updated last month
- Filling the Gaps in Ancient Akkadian Texts:A Masked Language Modelling Approach, Lazar et al., EMNLP 2021☆13Nov 10, 2022Updated 3 years ago
- Code for the paper "Modelling Latent Translations for Cross-Lingual Transfer"☆17Nov 22, 2021Updated 4 years ago
- Deploy docs from your source tree to a GitHub wiki☆13Jun 14, 2023Updated 2 years ago
- hydra-pl-wandb-sample-project is a NN experiment management code using hydra, pytorch-lightinig, and wandb.☆11Nov 22, 2021Updated 4 years ago
- Guidelines for our secondary layer of annotation adding multi-sentence AMR links☆12Sep 6, 2017Updated 8 years ago
- An AI assistant can help you with content composition right in your Microsoft Word☆17Sep 10, 2024Updated last year
- Adaptation datasets and scripts for the paper "Reducing gender bias in Neural Machine Translation as a domain adaptation problem" (ACL 20…☆13Mar 18, 2021Updated 4 years ago
- Learning to Hash for Maximum Inner Product Search☆12Jan 21, 2022Updated 4 years ago
- This repo implements Video generation model using Latent Diffusion Transformers(Latte) in PyTorch and provides training and inference cod…☆16Jan 6, 2025Updated last year
- Morphological analysis and generation of Amharic, Oromo, and Tigrinya☆11Feb 18, 2017Updated 8 years ago
- Pretraining scripts for BART transformer model☆12May 15, 2023Updated 2 years ago