gdamaskinos / unsupervised_topic_segmentationLinks
☆104Updated 4 years ago
Alternatives and similar repositories for unsupervised_topic_segmentation
Users that are interested in unsupervised_topic_segmentation are comparing it to the libraries listed below
Sorting:
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆97Updated 2 years ago
- Resources for the "CTRLsum: Towards Generic Controllable Text Summarization" paper☆147Updated 5 months ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆99Updated 2 years ago
- DialogSum: A Real-life Scenario Dialogue Summarization Dataset - Findings of ACL 2021☆180Updated 10 months ago
- MediaSum: A Large-scale Media Interview Dataset for Dialogue Summarization☆76Updated 4 years ago
- A collection of task-specific NLU datasets☆155Updated 3 years ago
- Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."☆141Updated 2 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆95Updated 6 months ago
- Dual Encoders for State-of-the-art Natural Language Processing.☆61Updated 3 years ago
- (yet another not really) awesome topic/text segmentation list☆109Updated 6 years ago
- ☆78Updated last year
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆120Updated 4 years ago
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆161Updated last year
- Segment documents into coherent parts using word embeddings.☆149Updated 3 years ago
- This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.☆34Updated 3 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 4 years ago
- ☆87Updated 3 years ago
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆145Updated 2 years ago
- Build a dialog dataset from online books in many languages☆76Updated 2 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆156Updated 2 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE)☆153Updated 5 years ago
- XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning☆104Updated 4 years ago
- Interpretable Evaluation for (Almost) All NLP Tasks☆195Updated last week
- ☆68Updated 5 months ago
- SUPERT: Unsupervised multi-document summarization evaluation & generation☆95Updated 2 years ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆56Updated 3 years ago
- GlossBERT: BERT for Word Sense Disambiguation with Gloss Knowledge (EMNLP 2019)☆96Updated 2 years ago
- Lexical Simplification with Pretrained Encoders☆70Updated 4 years ago
- ☆39Updated 2 years ago
- ☆30Updated 4 years ago