pytorch-tpu / transformers
π€ Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
β13Updated this week
Related projects β
Alternatives and complementary repositories for transformers
- A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretrainingβ12Updated 11 months ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.β92Updated last year
- Tutorial to pretrain & fine-tune a π€ Flax T5 model on a TPUv3-8 with GCPβ58Updated 2 years ago
- The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).β43Updated last year
- data related codebase for polyglot projectβ19Updated last year
- β23Updated last year
- Pre-training BART in Flax on The Pile datasetβ20Updated 3 years ago
- β42Updated 4 years ago
- Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasksβ63Updated 2 years ago
- Calculating Expected Time for training LLM.β38Updated last year
- PyTorch reimplementation of REALM and ORQAβ22Updated 2 years ago
- [NAACL 2021] Designing a Minimal Retrieve-and-Read System for Open-Domain Question Answeringβ36Updated 3 years ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.β72Updated 2 years ago
- Train π€transformers with DeepSpeed: ZeRO-2, ZeRO-3β21Updated 3 years ago
- Transformers at any scaleβ41Updated 9 months ago
- β95Updated last year
- β20Updated 3 years ago
- KETOD Knowledge-Enriched Task-Oriented Dialogueβ31Updated last year
- β26Updated 7 months ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"β26Updated 3 years ago
- β14Updated 8 months ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuningβ97Updated last year
- DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and Quantization (ACL 2022)β50Updated last year
- β55Updated last year
- β12Updated 10 months ago
- PyTorch reimplementation of the paper "SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization"β16Updated 3 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (httpsβ¦β43Updated 3 months ago
- β19Updated 2 years ago
- Official repository with code and data accompanying the NAACL 2021 paper "Hurdles to Progress in Long-form Question Answering" (https://aβ¦β46Updated 2 years ago