An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/pdf/2106.04718.pdf
☆435Aug 17, 2022Updated 3 years ago
Alternatives and similar repositories for fastseq
Users that are interested in fastseq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- FastFormers - highly efficient transformer models for NLU☆708Mar 21, 2025Updated last year
- LightSeq: A High Performance Library for Sequence Processing and Generation☆3,301May 16, 2023Updated 3 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- Code for ACL2021 paper: "GLGE: A New General Language Generation Evaluation Benchmark"☆57Oct 26, 2022Updated 3 years ago
- Autoregressive Entity Retrieval☆797Jul 6, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Nov 7, 2021Updated 4 years ago
- ⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.☆588Apr 24, 2023Updated 3 years ago
- ReConsider is a re-ranking model that re-ranks the top-K (passage, answer-span) predictions of an Open-Domain QA Model like DPR (Karpukhi…☆49Apr 26, 2021Updated 5 years ago
- A research project for natural language generation, containing the official implementations by MSRA NLC team.☆745Jul 25, 2024Updated last year
- [EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674☆193Jun 14, 2023Updated 2 years ago
- Parallelformers: An Efficient Model Parallelization Toolkit for Deployment☆788Apr 24, 2023Updated 3 years ago
- DeLighT: Very Deep and Light-Weight Transformers☆469Oct 16, 2020Updated 5 years ago
- ☆220Jun 8, 2020Updated 5 years ago
- Resources for the "CTRLsum: Towards Generic Controllable Text Summarization" paper☆150May 1, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821☆3,651Oct 16, 2024Updated last year
- Transformer with Untied Positional Encoding (TUPE). Code of paper "Rethinking Positional Encoding in Language Pre-training". Improve exis…☆253Nov 8, 2021Updated 4 years ago
- BANG is a new pretraining model to Bridge the gap between Autoregressive (AR) and Non-autoregressive (NAR) Generation. AR and NAR generat…☆28Feb 6, 2022Updated 4 years ago
- [EMNLP 2021] Improving and Simplifying Pattern Exploiting Training☆152Jun 10, 2022Updated 3 years ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆174Jun 6, 2021Updated 4 years ago
- [ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723☆728Aug 29, 2022Updated 3 years ago
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…☆606Jun 15, 2022Updated 3 years ago
- Tracking the progress in non-autoregressive generation (translation, transcription, etc.)☆302Mar 15, 2023Updated 3 years ago
- BLEURT is a metric for Natural Language Generation based on transfer learning.☆793Aug 4, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive…☆440May 26, 2025Updated 11 months ago
- Library for Knowledge Intensive Language Tasks☆973Mar 31, 2022Updated 4 years ago
- Trains Transformer model variants. Data isn't shuffled between batches.☆146Oct 5, 2022Updated 3 years ago
- SummVis is an interactive visualization tool for text summarization.☆253Jun 17, 2022Updated 3 years ago
- FactSumm: Factual Consistency Scorer for Abstractive Summarization☆113Jan 1, 2024Updated 2 years ago
- ☆113Sep 26, 2021Updated 4 years ago
- This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"☆1,626Jun 12, 2023Updated 2 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Jul 26, 2021Updated 4 years ago
- NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations☆788May 19, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Fast Block Sparse Matrices for Pytorch☆551Jan 21, 2021Updated 5 years ago
- Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃☆117Oct 27, 2022Updated 3 years ago
- ☆604Mar 12, 2026Updated 2 months ago
- Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀☆1,686Oct 23, 2024Updated last year
- Meta Representation Transformation for Low-resource Cross-lingual Learning☆42May 5, 2021Updated 5 years ago
- [NAACL 2021] Factual Probing Is [MASK]: Learning vs. Learning to Recall https://arxiv.org/abs/2104.05240☆168Oct 7, 2022Updated 3 years ago
- Flexible components pairing 🤗 Transformers with Pytorch Lightning☆611Nov 21, 2022Updated 3 years ago