gdamaskinos / unsupervised_topic_segmentationLinks
☆103Updated 4 years ago
Alternatives and similar repositories for unsupervised_topic_segmentation
Users that are interested in unsupervised_topic_segmentation are comparing it to the libraries listed below
Sorting:
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆97Updated 2 years ago
- Resources for the "CTRLsum: Towards Generic Controllable Text Summarization" paper☆147Updated 3 months ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆99Updated 2 years ago
- Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."☆141Updated 2 years ago
- DialogSum: A Real-life Scenario Dialogue Summarization Dataset - Findings of ACL 2021☆180Updated 8 months ago
- A collection of task-specific NLU datasets☆151Updated 3 years ago
- Dual Encoders for State-of-the-art Natural Language Processing.☆61Updated 3 years ago
- (yet another not really) awesome topic/text segmentation list☆109Updated 6 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆119Updated 3 years ago
- ☆87Updated 3 years ago
- MediaSum: A Large-scale Media Interview Dataset for Dialogue Summarization☆76Updated 4 years ago
- Build a dialog dataset from online books in many languages☆76Updated 2 years ago
- ☆78Updated last year
- Fine-tune transformers with pytorch-lightning☆44Updated 3 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 4 years ago
- BERTserini☆26Updated 2 years ago
- SUPERT: Unsupervised multi-document summarization evaluation & generation☆94Updated 2 years ago
- Interpretable Evaluation for (Almost) All NLP Tasks☆195Updated 2 years ago
- Use BERT to Fill in the Blanks☆83Updated 3 years ago
- Copora for evaluating NLU Services/Platforms such as Dialogflow, LUIS, Watson, Rasa etc.☆112Updated 3 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆94Updated 5 months ago
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆161Updated 11 months ago
- Repository that accompanies "An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction" (EMNLP 2019)☆211Updated 4 years ago
- ☆30Updated 4 years ago
- A Natural Language Inference (NLI) model based on Transformers (BERT and ALBERT)☆132Updated last year
- Segment documents into coherent parts using word embeddings.☆149Updated 3 years ago
- ☆30Updated 4 years ago
- Few-Shot-Intent-Detection includes popular challenging intent detection datasets with/without OOS queries and state-of-the-art baselines …☆149Updated 2 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆157Updated 2 years ago
- XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning☆105Updated 4 years ago