gdamaskinos / unsupervised_topic_segmentationLinks
☆103Updated 4 years ago
Alternatives and similar repositories for unsupervised_topic_segmentation
Users that are interested in unsupervised_topic_segmentation are comparing it to the libraries listed below
Sorting:
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆97Updated last year
- A collection of task-specific NLU datasets☆149Updated 3 years ago
- Dual Encoders for State-of-the-art Natural Language Processing.☆61Updated 2 years ago
- Resources for the "CTRLsum: Towards Generic Controllable Text Summarization" paper☆147Updated 2 months ago
- Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."☆140Updated 2 years ago
- (yet another not really) awesome topic/text segmentation list☆109Updated 6 years ago
- ☆77Updated last year
- DialogSum: A Real-life Scenario Dialogue Summarization Dataset - Findings of ACL 2021☆179Updated 7 months ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆98Updated 2 years ago
- SUPERT: Unsupervised multi-document summarization evaluation & generation☆94Updated 2 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆156Updated 2 years ago
- ☆30Updated 4 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆92Updated 4 months ago
- MediaSum: A Large-scale Media Interview Dataset for Dialogue Summarization☆75Updated 3 years ago
- ☆87Updated 3 years ago
- ☆68Updated 2 months ago
- Build a dialog dataset from online books in many languages☆75Updated 2 years ago
- BERTserini☆26Updated 2 years ago
- ☆38Updated 2 years ago
- Data & Code for ACCENTOR: "Adding Chit-Chat to Enhance Task-Oriented Dialogues" (NAACL 2021)☆71Updated 3 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆119Updated 3 years ago
- Text Extraction Formulation + Feedback Loop for state-of-the-art WSD (EMNLP 2021)☆53Updated 3 years ago
- Improving Unsupervised Dialogue Topic Segmentation with Utterance-Pair Coherence Scoring☆64Updated last year
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 3 years ago
- Copora for evaluating NLU Services/Platforms such as Dialogflow, LUIS, Watson, Rasa etc.☆112Updated 3 years ago
- Segment documents into coherent parts using word embeddings.☆149Updated 3 years ago
- Coreference resolution with different higher-order inference methods; implemented in PyTorch.☆36Updated 2 years ago
- Lexical Simplification with Pretrained Encoders☆70Updated 4 years ago
- A multilingual version of MS MARCO passage ranking dataset☆145Updated last year
- ☆40Updated 4 years ago