A research project for natural language generation, containing the official implementations by MSRA NLC team.
☆745Jul 25, 2024Updated last year
Alternatives and similar repositories for ProphetNet
Users that are interested in ProphetNet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Diffusion-LM☆1,242Aug 8, 2024Updated last year
- An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…☆435Aug 17, 2022Updated 3 years ago
- [ICLR'23] DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models☆837Mar 1, 2024Updated 2 years ago
- Code for ACL2021 paper: "GLGE: A New General Language Generation Evaluation Benchmark"☆57Oct 26, 2022Updated 3 years ago
- Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation [NAACL 2024]☆99Aug 17, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆70Jun 16, 2022Updated 3 years ago
- Code for ACL 2020 paper: "Extractive Summarization as Text Matching"☆522Jan 11, 2022Updated 4 years ago
- SimXNS is a research project for information retrieval. This repo contains official implementations by MSRA NLC team.☆116Jan 9, 2024Updated 2 years ago
- ☆1,655Jul 20, 2023Updated 2 years ago
- Longformer: The Long-Document Transformer☆2,194Feb 8, 2023Updated 3 years ago
- ☆13Jul 20, 2021Updated 4 years ago
- ☆25Oct 20, 2022Updated 3 years ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆22,128Jan 23, 2026Updated 3 months ago
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,516Jan 14, 2026Updated 4 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code for EMNLP2020 paper: "Tell Me How to Ask Again: Question Data Augmentation with Controllable Rewriting in Continuous Space"☆26May 10, 2021Updated 5 years ago
- code for EMNLP 2019 paper Text Summarization with Pretrained Encoders☆1,303Jul 25, 2024Updated last year
- Research code for ACL 2020 paper: "Distilling Knowledge Learned in BERT for Text Generation".☆129Jun 30, 2021Updated 4 years ago
- ACL'2023: DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models☆342Feb 17, 2024Updated 2 years ago
- GSum: A General Framework for Guided Neural Abstractive Summarization☆115Sep 22, 2025Updated 7 months ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆32,222Sep 30, 2025Updated 7 months ago
- BLEURT is a metric for Natural Language Generation based on transfer learning.☆793Aug 4, 2023Updated 2 years ago
- MPNet: Masked and Permuted Pre-training for Language Understanding https://arxiv.org/pdf/2004.09297.pdf☆299Sep 11, 2021Updated 4 years ago
- Code for EMNLP2020 paper: "Diverse, Controllable, and Keyphrase-Aware: A Corpus and Method for News Multi-Headline Generation"☆20Dec 3, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆40Jun 2, 2021Updated 4 years ago
- Dense Passage Retriever - is a set of tools and models for open domain Q&A task.☆1,865Apr 6, 2023Updated 3 years ago
- Shared repository for open-sourced projects from the Google AI Language team.☆1,775May 11, 2026Updated last week
- MASS: Masked Sequence to Sequence Pre-training for Language Generation☆1,120Nov 28, 2022Updated 3 years ago
- The implementation of DeBERTa☆2,219Sep 29, 2023Updated 2 years ago
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821☆3,651Oct 16, 2024Updated last year
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,370Mar 23, 2024Updated 2 years ago
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…☆652Jan 4, 2023Updated 3 years ago
- This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"☆1,626Jun 12, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- BERT score for text generation☆1,898Jul 30, 2024Updated last year
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- A masked language modeling objective to train a model to predict any subset of the target words, conditioned on both the input text and a…☆246Sep 17, 2021Updated 4 years ago
- Library for Knowledge Intensive Language Tasks☆973Mar 31, 2022Updated 4 years ago
- Code for paper "Discourse-Aware Neural Extractive Text Summarization" (ACL20)☆166Apr 25, 2020Updated 6 years ago
- ☆150Feb 27, 2024Updated 2 years ago
- ☆1,295Dec 15, 2022Updated 3 years ago