edahanoam/Awesome-Summarization-Datasets

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/edahanoam/Awesome-Summarization-Datasets)

edahanoam / Awesome-Summarization-Datasets

Updating collection of summarization datasets in 100+ languages, based on our paper "The State and Fate of Summarization Datasets: A Survey".

☆31

Alternatives and similar repositories for Awesome-Summarization-Datasets

Users that are interested in Awesome-Summarization-Datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

eliahuhorwitz / MoTHer
View on GitHub
Official PyTorch Implementation for the "Unsupervised Model Tree Heritage Recovery" paper (ICLR 2025).
☆62Jul 1, 2025Updated last year
SLAB-NLP / Multi-Prompt-LLM-Evaluation
View on GitHub
State of What Art? A Call for Multi-Prompt LLM Evaluation
☆16Apr 10, 2026Updated 3 months ago
nitzanlab / Annotatability
View on GitHub
Annotatability, a method to identify meaningful patterns in single-cell genomics data through annotation-trainability analysis, which est…
☆19Jun 23, 2025Updated last year
niveck / LLMafia
View on GitHub
Asynchronous LLM Agent playing games of Mafia against human players
☆23Nov 12, 2025Updated 8 months ago
assafbk / OPRM
View on GitHub
Overflow Prevention Enhances Long-Context Recurrent LLMs (COLM 2025)
☆18Jul 8, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
lt-asset / Waffle
View on GitHub
For ACL25 paper "WAFFLE: Multi-Modal Model for Automated Front-End Development" - by Shanchao Liang and Nan Jiang and Shangshu Qian and L…
☆12May 28, 2025Updated last year
schwartz-lab-NLP / Tokens2Words
View on GitHub
☆16Apr 2, 2025Updated last year
evidencebp / commit-classification
View on GitHub
☆20Apr 24, 2025Updated last year
nfriedri / annie-annotation-platform
View on GitHub
☆31Apr 2, 2022Updated 4 years ago
LingxiaoShawn / GLAM
View on GitHub
Source Code for Graph Anomaly Detection with Unsupervised GNNs (ICDM2022)
☆11Oct 18, 2022Updated 3 years ago
s-lilo / brat-peek
View on GitHub
Framework for working with brat-annotated .ann files
☆10Mar 16, 2026Updated 4 months ago
ikergarcia1996 / T-Projection
View on GitHub
T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.
☆13Nov 21, 2023Updated 2 years ago
ikergarcia1996 / Sequence-Labeling-LLMs
View on GitHub
The code to perform Sequence Labelling with LLMs, including T5, FLAN, LLaMA, Alpaca and more!
☆14Nov 5, 2024Updated last year
jchook / wordseg
View on GitHub
Fast word segmentation with a focus on splitting #hashtags
☆14Sep 29, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
danielvarab / massive-summ
View on GitHub
☆31Apr 21, 2023Updated 3 years ago
dimalik / prediction_error
View on GitHub
Neural embeddings with negative sampling in Keras
☆11Jun 11, 2017Updated 9 years ago
suamin / MedDistant19
View on GitHub
MedDistant19: Towards an Accurate Benchmark for Broad-Coverage Biomedical Relation Extraction (COLING 2022)
☆19Oct 13, 2022Updated 3 years ago
lalalamdbf / PLSE_IDRR
View on GitHub
The Code for the EMNLP 2023 main conference paper "Prompt-based Logical Semantics Enhancement for Implicit Discourse Relation Recognition…
☆13Dec 10, 2023Updated 2 years ago
eliorsulem / simplification-acl2018
View on GitHub
Human Evaluation Benchmark for Text Simplification
☆10Sep 6, 2018Updated 7 years ago
fajri91 / NeuralRST-TopDown
View on GitHub
EACL 2021
☆11May 4, 2021Updated 5 years ago
tkutschbach / RST-Tace
View on GitHub
A tool for automatic comparison and evaluation of RST trees
☆12Apr 10, 2025Updated last year
jonkahana / CLIPPR
View on GitHub
An official PyTorch implementation for CLIPPR
☆31Jul 22, 2023Updated 3 years ago
timjogorman / Multisentence-AMR-guidelines
View on GitHub
Guidelines for our secondary layer of annotation adding multi-sentence AMR links
☆12Sep 6, 2017Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
GaoleMeng / ActiveLearningAnnotationTool
View on GitHub
An active annotation tool based on brat(https://github.com/nlplab/brat)
☆19Aug 22, 2017Updated 8 years ago
eliahuhorwitz / Conffusion
View on GitHub
Official Implementation for the "Conffusion: Confidence Intervals for Diffusion Models" paper.
☆144Nov 27, 2022Updated 3 years ago
DCSaunders / gender-debias
View on GitHub
Adaptation datasets and scripts for the paper "Reducing gender bias in Neural Machine Translation as a domain adaptation problem" (ACL 20…
☆13Mar 18, 2021Updated 5 years ago
dialogue-evaluation / RuCoCo-2023
View on GitHub
Russian coreference resolution competition
☆11Mar 24, 2023Updated 3 years ago
writing-assistant / writing-assistant.github.io
View on GitHub
☆18Sep 3, 2024Updated last year
JL-sky / muduoChatServer
View on GitHub
基于c++ muduo网络库的集群聊天服务器，使用nginx实现负载均衡，使用reids消息队列实现跨服务器通信
☆13Feb 23, 2024Updated 2 years ago
parry2403 / R2N2
View on GitHub
RhetoricalRecursiveNeuralNetwork(R2N2) is recursive neural network using RST for NLP Tasks such as Sentiment Analysis
☆12Sep 2, 2015Updated 10 years ago
fabrahman / char-centric-story
View on GitHub
Codebase for character-centric story understanding
☆14Jan 20, 2022Updated 4 years ago
deeppavlov / dp-dream-demos
View on GitHub
Most basic AI Assistant demo derived from the DeepPavlov Dream AI Assistant.
☆14May 22, 2023Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
MilaNLProc / language-invariant-properties
View on GitHub
☆22Mar 31, 2022Updated 4 years ago
jacklxc / StandAloneSpellingCorrection
View on GitHub
Repository for Findings of EMNLP 2020 "Context-aware Stand-alone Neural Spelling Correction"
☆18Dec 21, 2020Updated 5 years ago
ethz-spylab / satml-llm-ctf
View on GitHub
Code used to run the platform for the LLM CTF colocated with SaTML 2024
☆29Mar 20, 2024Updated 2 years ago
smilelight / lightSpider
View on GitHub
lightsmile个人的用于爬取网络公开语料数据的mini通用爬虫框架。
☆13Sep 30, 2020Updated 5 years ago
yunan4nlp / NNDisParser
View on GitHub
☆10Aug 30, 2022Updated 3 years ago
Jellyfish042 / RWKV-15Puzzle
View on GitHub
☆12Dec 14, 2024Updated last year
lovodkin93 / attribute-first-then-generate
View on GitHub
Repository for "Attribute First, then Generate: Locally-attributable Grounded Text Generation", ACL 2024
☆30Dec 19, 2024Updated last year