A research project for natural language generation, containing the official implementations by MSRA NLC team.
☆742Jul 25, 2024Updated last year
Alternatives and similar repositories for ProphetNet
Users that are interested in ProphetNet are comparing it to the libraries listed below
Sorting:
- An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…☆433Aug 17, 2022Updated 3 years ago
- Diffusion-LM☆1,224Aug 8, 2024Updated last year
- [ICLR'23] DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models☆828Mar 1, 2024Updated last year
- Code for ACL 2020 paper: "Extractive Summarization as Text Matching"☆522Jan 11, 2022Updated 4 years ago
- Longformer: The Long-Document Transformer☆2,188Feb 8, 2023Updated 3 years ago
- ☆1,651Jul 20, 2023Updated 2 years ago
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,490Jan 14, 2026Updated last month
- Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation [NAACL 2024]☆99Aug 17, 2023Updated 2 years ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆22,033Jan 23, 2026Updated last month
- code for EMNLP 2019 paper Text Summarization with Pretrained Encoders☆1,304Jul 25, 2024Updated last year
- Research code for ACL 2020 paper: "Distilling Knowledge Learned in BERT for Text Generation".☆129Jun 30, 2021Updated 4 years ago
- Code for ACL2021 paper: "GLGE: A New General Language Generation Evaluation Benchmark"☆57Oct 26, 2022Updated 3 years ago
- BLEURT is a metric for Natural Language Generation based on transfer learning.☆786Aug 4, 2023Updated 2 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆32,159Sep 30, 2025Updated 5 months ago
- The implementation of DeBERTa☆2,194Sep 29, 2023Updated 2 years ago
- GSum: A General Framework for Guided Neural Abstractive Summarization☆116Sep 22, 2025Updated 5 months ago
- ☆40Jun 2, 2021Updated 4 years ago
- Dense Passage Retriever - is a set of tools and models for open domain Q&A task.☆1,860Apr 6, 2023Updated 2 years ago
- Shared repository for open-sourced projects from the Google AI Language team.☆1,749Feb 20, 2026Updated last week
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…☆650Jan 4, 2023Updated 3 years ago
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821☆3,641Oct 16, 2024Updated last year
- This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"☆1,628Jun 12, 2023Updated 2 years ago
- MPNet: Masked and Permuted Pre-training for Language Understanding https://arxiv.org/pdf/2004.09297.pdf☆298Sep 11, 2021Updated 4 years ago
- ☆70Jun 16, 2022Updated 3 years ago
- MASS: Masked Sequence to Sequence Pre-training for Language Generation☆1,123Nov 28, 2022Updated 3 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,371Mar 23, 2024Updated last year
- Entity Linker solution☆1,206Sep 21, 2023Updated 2 years ago
- A masked language modeling objective to train a model to predict any subset of the target words, conditioned on both the input text and a…☆246Sep 17, 2021Updated 4 years ago
- Library for Knowledge Intensive Language Tasks☆965Mar 31, 2022Updated 3 years ago
- SimXNS is a research project for information retrieval. This repo contains official implementations by MSRA NLC team.☆116Jan 9, 2024Updated 2 years ago
- Code for using and evaluating SpanBERT.☆904Jul 25, 2023Updated 2 years ago
- EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"☆345Nov 11, 2024Updated last year
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.☆1,155Feb 20, 2024Updated 2 years ago
- BERT score for text generation☆1,873Jul 30, 2024Updated last year
- ACL'2023: DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models☆339Feb 17, 2024Updated 2 years ago
- Code for paper "Discourse-Aware Neural Extractive Text Summarization" (ACL20)☆165Apr 25, 2020Updated 5 years ago
- State-of-the-Art Text Embeddings☆18,298Feb 20, 2026Updated last week
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation☆475Mar 7, 2024Updated last year