☆537Feb 13, 2024Updated 2 years ago
Alternatives and similar repositories for byt5
Users that are interested in byt5 are comparing it to the libraries listed below
Sorting:
- ☆1,297Dec 15, 2022Updated 3 years ago
- ☆2,950Jan 15, 2026Updated last month
- ☆184May 26, 2023Updated 2 years ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆175Jun 6, 2021Updated 4 years ago
- Library for Knowledge Intensive Language Tasks☆967Mar 31, 2022Updated 3 years ago
- A utility for storing and reading files for Korean LM training 💾☆35Oct 15, 2025Updated 4 months ago
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…☆606Jun 15, 2022Updated 3 years ago
- OSLO: Open Source framework for Large-scale model Optimization☆309Aug 25, 2022Updated 3 years ago
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,489Jan 14, 2026Updated last month
- The implementation of DeBERTa☆2,197Sep 29, 2023Updated 2 years ago
- Parallelformers: An Efficient Model Parallelization Toolkit for Deployment☆791Apr 24, 2023Updated 2 years ago
- Flexible components pairing 🤗 Transformers with Pytorch Lightning☆612Nov 21, 2022Updated 3 years ago
- ☆75Jul 2, 2021Updated 4 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆157Dec 20, 2023Updated 2 years ago
- NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations☆786May 19, 2024Updated last year
- Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks☆62Jan 22, 2022Updated 4 years ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆209Aug 31, 2021Updated 4 years ago
- Autoregressive Entity Retrieval☆797Jul 6, 2023Updated 2 years ago
- Shared repository for open-sourced projects from the Google AI Language team.☆1,752Feb 20, 2026Updated 2 weeks ago
- GC4LM: A Colossal (Biased) language model for German☆13May 2, 2021Updated 4 years ago
- This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"☆1,628Jun 12, 2023Updated 2 years ago
- ☆1,560Feb 20, 2026Updated 2 weeks ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Jul 26, 2021Updated 4 years ago
- M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer☆54Nov 21, 2022Updated 3 years ago
- ⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.☆589Apr 24, 2023Updated 2 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆120Sep 26, 2021Updated 4 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- QED: A Framework and Dataset for Explanations in Question Answering☆119Aug 3, 2021Updated 4 years ago
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆465Nov 5, 2022Updated 3 years ago
- ☆221Jun 8, 2020Updated 5 years ago
- Task-based datasets, preprocessing, and evaluation for sequence models.☆594Feb 3, 2026Updated last month
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆96Feb 9, 2023Updated 3 years ago
- Resources for the "CTRLsum: Towards Generic Controllable Text Summarization" paper☆149May 1, 2025Updated 10 months ago
- 매주 목요일, 20:00 모임☆16Jul 24, 2020Updated 5 years ago
- An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…☆433Aug 17, 2022Updated 3 years ago
- Repository for the paper "Optimal Subarchitecture Extraction for BERT"☆470Jun 22, 2022Updated 3 years ago
- Long Range Arena for Benchmarking Efficient Transformers☆781Dec 16, 2023Updated 2 years ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Dec 21, 2021Updated 4 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,371Mar 23, 2024Updated last year