FastFormers - highly efficient transformer models for NLU
☆708Mar 21, 2025Updated last year
Alternatives and similar repositories for fastformers
Users that are interested in fastformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- Repository for the paper "Optimal Subarchitecture Extraction for BERT"☆470Jun 22, 2022Updated 3 years ago
- Prune a model while finetuning or training.☆406Jun 21, 2022Updated 3 years ago
- ☆220Jun 8, 2020Updated 5 years ago
- An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…☆435Aug 17, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Jul 26, 2021Updated 4 years ago
- a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.☆1,548Jul 18, 2025Updated 10 months ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆174Jun 6, 2021Updated 4 years ago
- Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.☆1,754Dec 20, 2023Updated 2 years ago
- Library for Knowledge Intensive Language Tasks☆973Mar 31, 2022Updated 4 years ago
- Longformer: The Long-Document Transformer☆2,195Feb 8, 2023Updated 3 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,370Mar 23, 2024Updated 2 years ago
- DeLighT: Very Deep and Light-Weight Transformers☆469Oct 16, 2020Updated 5 years ago
- Papers & presentation materials from Hugging Face's internal science day☆2,052Oct 31, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Parallelformers: An Efficient Model Parallelization Toolkit for Deployment☆788Apr 24, 2023Updated 3 years ago
- Beyond Accuracy: Behavioral Testing of NLP models with CheckList☆2,051Jan 9, 2024Updated 2 years ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,931Feb 14, 2023Updated 3 years ago
- Pytorch library for fast transformer implementations☆1,771Mar 23, 2023Updated 3 years ago
- Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀☆1,686Oct 23, 2024Updated last year
- The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic …☆3,652Apr 15, 2026Updated last month
- Robustness Gym is an evaluation toolkit for machine learning.☆447Jun 28, 2022Updated 3 years ago
- Fast Block Sparse Matrices for Pytorch☆551Jan 21, 2021Updated 5 years ago
- [ICLR 2020] Lite Transformer with Long-Short Range Attention☆610Jul 11, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Super easy library for BERT based NLP models☆1,919Aug 19, 2024Updated last year
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,516Jan 14, 2026Updated 4 months ago
- A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)☆5,629Apr 7, 2026Updated last month
- DeText: A Deep Neural Text Understanding Framework for Ranking and Classification Tasks☆1,265Mar 2, 2023Updated 3 years ago
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆6,175May 28, 2023Updated 2 years ago
- Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.☆3,162Jan 22, 2024Updated 2 years ago
- This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"☆1,626Jun 12, 2023Updated 2 years ago
- Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.☆257Nov 2, 2022Updated 3 years ago
- Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing☆795Jul 22, 2025Updated 9 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- PyTorch extensions for high performance and large scale training.☆3,406Apr 26, 2025Updated last year
- Transformer training code for sequential tasks☆610Sep 14, 2021Updated 4 years ago
- TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs…☆3,417Apr 17, 2026Updated last month
- Data augmentation for NLP☆4,657Jun 24, 2024Updated last year
- Reformer, the efficient Transformer, in Pytorch☆2,190Jun 21, 2023Updated 2 years ago
- PyTorch code for EMNLP 2020 Paper "Vokenization: Improving Language Understanding with Visual Supervision"☆191Mar 8, 2021Updated 5 years ago
- Unofficial PyTorch implementation of Fastformer based on paper "Fastformer: Additive Attention Can Be All You Need"."☆131Sep 6, 2021Updated 4 years ago