A research project for natural language generation, containing the official implementations by MSRA NLC team.
☆744Jul 25, 2024Updated last year
Alternatives and similar repositories for ProphetNet
Users that are interested in ProphetNet are comparing it to the libraries listed below
Sorting:
- Diffusion-LM☆1,229Aug 8, 2024Updated last year
- An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…☆433Aug 17, 2022Updated 3 years ago
- [ICLR'23] DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models☆830Mar 1, 2024Updated 2 years ago
- Code for ACL2021 paper: "GLGE: A New General Language Generation Evaluation Benchmark"☆57Oct 26, 2022Updated 3 years ago
- Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation [NAACL 2024]☆99Aug 17, 2023Updated 2 years ago
- ☆70Jun 16, 2022Updated 3 years ago
- Code for ACL 2020 paper: "Extractive Summarization as Text Matching"☆522Jan 11, 2022Updated 4 years ago
- SimXNS is a research project for information retrieval. This repo contains official implementations by MSRA NLC team.☆116Jan 9, 2024Updated 2 years ago
- ☆1,651Jul 20, 2023Updated 2 years ago
- Longformer: The Long-Document Transformer☆2,189Feb 8, 2023Updated 3 years ago
- ☆13Jul 20, 2021Updated 4 years ago
- ☆25Oct 20, 2022Updated 3 years ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆22,046Jan 23, 2026Updated last month
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,494Jan 14, 2026Updated 2 months ago
- Code for EMNLP2020 paper: "Tell Me How to Ask Again: Question Data Augmentation with Controllable Rewriting in Continuous Space"☆26May 10, 2021Updated 4 years ago
- code for EMNLP 2019 paper Text Summarization with Pretrained Encoders☆1,303Jul 25, 2024Updated last year
- Research code for ACL 2020 paper: "Distilling Knowledge Learned in BERT for Text Generation".☆129Jun 30, 2021Updated 4 years ago
- ACL'2023: DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models☆339Feb 17, 2024Updated 2 years ago
- GSum: A General Framework for Guided Neural Abstractive Summarization☆116Sep 22, 2025Updated 5 months ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆32,191Sep 30, 2025Updated 5 months ago
- BLEURT is a metric for Natural Language Generation based on transfer learning.☆788Aug 4, 2023Updated 2 years ago
- MPNet: Masked and Permuted Pre-training for Language Understanding https://arxiv.org/pdf/2004.09297.pdf☆297Sep 11, 2021Updated 4 years ago
- Code for EMNLP2020 paper: "Diverse, Controllable, and Keyphrase-Aware: A Corpus and Method for News Multi-Headline Generation"☆20Dec 3, 2020Updated 5 years ago
- ☆40Jun 2, 2021Updated 4 years ago
- Dense Passage Retriever - is a set of tools and models for open domain Q&A task.☆1,863Apr 6, 2023Updated 2 years ago
- Shared repository for open-sourced projects from the Google AI Language team.☆1,759Mar 12, 2026Updated last week
- MASS: Masked Sequence to Sequence Pre-training for Language Generation☆1,122Nov 28, 2022Updated 3 years ago
- The implementation of DeBERTa☆2,201Sep 29, 2023Updated 2 years ago
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821☆3,643Oct 16, 2024Updated last year
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,371Mar 23, 2024Updated last year
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…☆651Jan 4, 2023Updated 3 years ago
- This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"☆1,626Jun 12, 2023Updated 2 years ago
- BERT score for text generation☆1,880Jul 30, 2024Updated last year
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- A masked language modeling objective to train a model to predict any subset of the target words, conditioned on both the input text and a…☆246Sep 17, 2021Updated 4 years ago
- Library for Knowledge Intensive Language Tasks☆970Mar 31, 2022Updated 3 years ago
- [IJCAI'23] The official Github page of the paper "Diffusion Models for Non-autoregressive Text Generation: A Survey".☆60May 24, 2024Updated last year
- Code for paper "Discourse-Aware Neural Extractive Text Summarization" (ACL20)☆166Apr 25, 2020Updated 5 years ago
- ☆150Feb 27, 2024Updated 2 years ago