⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.
☆588Apr 24, 2023Updated 3 years ago
Alternatives and similar repositories for fastT5
Users that are interested in fastT5 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.☆257Nov 2, 2022Updated 3 years ago
- Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀☆1,687Oct 23, 2024Updated last year
- Convert BART models to ONNX with quantization. 3X reduction in size, and upto 3X boost in inference speed☆33Dec 11, 2024Updated last year
- An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…☆435Aug 17, 2022Updated 3 years ago
- simpleT5 is built on top of PyTorch-lightning⚡️ and Transformers🤗 that lets you quickly train your T5 models.☆401May 19, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆1,294Dec 15, 2022Updated 3 years ago
- 🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization…☆3,416Updated this week
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆127Dec 5, 2020Updated 5 years ago
- NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations☆787May 19, 2024Updated 2 years ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆174Jun 6, 2021Updated 5 years ago
- The implementation of DeBERTa☆2,228Sep 29, 2023Updated 2 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Sep 2, 2024Updated last year
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆358Feb 22, 2022Updated 4 years ago
- Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackab…☆1,586Jan 28, 2026Updated 4 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Fast inference engine for Transformer models☆4,517Jun 7, 2026Updated last week
- Neural question generation using transformers☆1,144Apr 5, 2024Updated 2 years ago
- Data augmentation for NLP☆4,658Updated this week
- This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"☆1,625Jun 12, 2023Updated 3 years ago
- Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the…☆2,103Aug 15, 2024Updated last year
- ☆2,967May 20, 2026Updated 3 weeks ago
- Parallelformers: An Efficient Model Parallelization Toolkit for Deployment☆788Apr 24, 2023Updated 3 years ago
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,522Jan 14, 2026Updated 5 months ago
- [EMNLP 2021] Improving and Simplifying Pattern Exploiting Training☆152Jun 10, 2022Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A Neural Language Style Transfer framework to transfer natural language text smoothly between fine-grained language styles like formal/ca…☆493Dec 12, 2023Updated 2 years ago
- ☆545Feb 13, 2024Updated 2 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Jul 26, 2021Updated 4 years ago
- State-of-the-Art Embeddings, Retrieval, and Reranking☆18,805Updated this week
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆157Dec 20, 2023Updated 2 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆108May 20, 2022Updated 4 years ago
- OSLO: Open Source framework for Large-scale model Optimization☆309Aug 25, 2022Updated 3 years ago
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆163Sep 24, 2024Updated last year
- Prune a model while finetuning or training.☆407Jun 21, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Efficient few-shot learning with Sentence Transformers☆2,746May 26, 2026Updated 2 weeks ago
- A PyTorch-based model pruning toolkit for pre-trained language models☆390Aug 31, 2023Updated 2 years ago
- NeuSpell: A Neural Spelling Correction Toolkit☆712Jul 31, 2023Updated 2 years ago
- exBERT on Transformers🤗☆10Jun 14, 2021Updated 5 years ago
- Question-answers, collected from Google☆132Jul 23, 2021Updated 4 years ago
- Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks☆62Jan 22, 2022Updated 4 years ago
- LUKE -- Language Understanding with Knowledge-based Embeddings☆726Nov 19, 2023Updated 2 years ago