maroxtn / mt5-M2M-comparisonLinks
Comparing M2M and mT5 on a rare language pairs, blog post: https://medium.com/@abdessalemboukil/comparing-facebooks-m2m-to-mt5-in-low-resources-translation-english-yoruba-ef56624d2b75
☆16Updated 4 years ago
Alternatives and similar repositories for mt5-M2M-comparison
Users that are interested in mt5-M2M-comparison are comparing it to the libraries listed below
Sorting:
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆105Updated 3 years ago
- Some notebooks for NLP☆207Updated last year
- MAFAND-MT☆57Updated last year
- Fine-tuning GPT-2 Small for Question Answering☆130Updated 2 years ago
- Comprehensive NLP Evaluation System☆188Updated last year
- An example of multilingual machine translation using a pretrained version of mt5 from Hugging Face.☆42Updated 4 years ago
- Vocabulary Trimming (VT) is a model compression technique, which reduces a multilingual LM vocabulary to a target language by deleting ir…☆43Updated 10 months ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- Crosslingual Question Answering for African Languages☆31Updated 11 months ago
- ☆183Updated 2 years ago
- Efficient Attention for Long Sequence Processing☆98Updated last year
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆33Updated 3 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 3 years ago
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+☆37Updated 4 years ago
- PyTorch implementation of NMT models along with custom tokenizers, models, and datasets☆20Updated 3 years ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆47Updated 2 years ago
- [EMNLP-Findings 2020] Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences☆63Updated last year
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆161Updated 11 months ago
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆75Updated 3 years ago
- An instruction-based benchmark for text improvements.☆141Updated 2 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆155Updated last year
- Generating boolean (yes/no) questions from any content using T5 text-to-text transformer model and BoolQ dataset☆35Updated 2 years ago
- Abstractive and Extractive Text summarization using Transformers.☆85Updated 2 years ago
- A simple approach to use GPT2-medium (345M) for generating high quality text summaries with minimal training.☆156Updated 2 years ago
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …☆82Updated 2 years ago
- A Multilingual Replicable Instruction-Following Model☆94Updated 2 years ago
- Fine-tuned BERT on SQuAd 2.0 Dataset. Applied Knowledge Distillation (KD) and fine-tuned DistilBERT (student) using BERT as the teacher m…☆25Updated 4 years ago
- ☆57Updated 3 years ago
- A collection of preprocessed datasets and pretrained models for generating paraphrases.☆30Updated 4 years ago
- ☆44Updated 4 years ago