⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.
☆588Apr 24, 2023Updated 3 years ago
Alternatives and similar repositories for fastT5
Users that are interested in fastT5 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.☆257Nov 2, 2022Updated 3 years ago
- Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀☆1,689Oct 23, 2024Updated last year
- Convert BART models to ONNX with quantization. 3X reduction in size, and upto 3X boost in inference speed☆33Dec 11, 2024Updated last year
- An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…☆434Aug 17, 2022Updated 3 years ago
- simpleT5 is built on top of PyTorch-lightning⚡️ and Transformers🤗 that lets you quickly train your T5 models.☆402May 19, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆1,294Dec 15, 2022Updated 3 years ago
- 🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization…☆3,426Jun 22, 2026Updated last week
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆127Dec 5, 2020Updated 5 years ago
- NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations☆786May 19, 2024Updated 2 years ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆175Jun 6, 2021Updated 5 years ago
- The implementation of DeBERTa☆2,238Sep 29, 2023Updated 2 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆925Sep 2, 2024Updated last year
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackab…☆1,585Jan 28, 2026Updated 5 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Fast inference engine for Transformer models☆4,552Updated this week
- Neural question generation using transformers☆1,142Apr 5, 2024Updated 2 years ago
- Data augmentation for NLP☆4,660Jun 20, 2026Updated 2 weeks ago
- This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"☆1,625Jun 12, 2023Updated 3 years ago
- Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the…☆2,101Aug 15, 2024Updated last year
- ☆2,973Jun 15, 2026Updated 2 weeks ago
- Parallelformers: An Efficient Model Parallelization Toolkit for Deployment☆788Apr 24, 2023Updated 3 years ago
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,528Jan 14, 2026Updated 5 months ago
- [EMNLP 2021] Improving and Simplifying Pattern Exploiting Training☆152Jun 10, 2022Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- A Neural Language Style Transfer framework to transfer natural language text smoothly between fine-grained language styles like formal/ca…☆493Dec 12, 2023Updated 2 years ago
- ☆546Feb 13, 2024Updated 2 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Jul 26, 2021Updated 4 years ago
- State-of-the-Art Embeddings, Retrieval, and Reranking☆18,853Jun 26, 2026Updated last week
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆157Dec 20, 2023Updated 2 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆108May 20, 2022Updated 4 years ago
- OSLO: Open Source framework for Large-scale model Optimization☆309Aug 25, 2022Updated 3 years ago
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆163Sep 24, 2024Updated last year
- Prune a model while finetuning or training.☆408Jun 21, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Efficient few-shot learning with Sentence Transformers☆2,761May 26, 2026Updated last month
- A PyTorch-based model pruning toolkit for pre-trained language models☆390Aug 31, 2023Updated 2 years ago
- NeuSpell: A Neural Spelling Correction Toolkit☆713Jul 31, 2023Updated 2 years ago
- exBERT on Transformers🤗☆10Jun 14, 2021Updated 5 years ago
- Question-answers, collected from Google☆132Jul 23, 2021Updated 4 years ago
- Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks☆62Jan 22, 2022Updated 4 years ago
- LUKE -- Language Understanding with Knowledge-based Embeddings☆726Nov 19, 2023Updated 2 years ago