maroxtn / mt5-M2M-comparison
Comparing M2M and mT5 on a rare language pairs, blog post: https://medium.com/@abdessalemboukil/comparing-facebooks-m2m-to-mt5-in-low-resources-translation-english-yoruba-ef56624d2b75
☆15Updated 3 years ago
Alternatives and similar repositories for mt5-M2M-comparison:
Users that are interested in mt5-M2M-comparison are comparing it to the libraries listed below
- MAFAND-MT☆55Updated 6 months ago
- An example of multilingual machine translation using a pretrained version of mt5 from Hugging Face.☆42Updated 3 years ago
- ☆51Updated last year
- A collection of preprocessed datasets and pretrained models for generating paraphrases.☆29Updated 3 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+☆37Updated 3 years ago
- The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)☆52Updated 2 years ago
- Crosslingual Question Answering for African Languages☆29Updated 3 months ago
- Fine-tuning GPT-2 Small for Question Answering☆130Updated 2 years ago
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …☆80Updated 2 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆102Updated 2 years ago
- Using short models to classify long texts☆21Updated last year
- Source code for the GPT-2 story generation models in the EMNLP 2020 paper "STORIUM: A Dataset and Evaluation Platform for Human-in-the-Lo…☆39Updated last year
- BERT, RoBERTa fine-tuning over SQuAD Dataset using pytorch-lightning⚡️, 🤗-transformers & 🤗-nlp.☆36Updated last year
- ☆20Updated 3 years ago
- ☆11Updated 2 years ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆46Updated 2 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆187Updated 3 years ago
- Hinglish Text Classification☆30Updated last year
- Helper scripts and notes that were used while porting various nlp models☆45Updated 2 years ago
- Fine-tuned BERT on SQuAd 2.0 Dataset. Applied Knowledge Distillation (KD) and fine-tuned DistilBERT (student) using BERT as the teacher m…☆25Updated 3 years ago
- Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)☆22Updated last year
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated last year
- a large scientific paraphrase dataset for longer paraphrase generation☆38Updated 2 years ago
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆31Updated 2 years ago
- ☆30Updated 2 years ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆99Updated 8 months ago
- MasakhaNEWS: News Topic Classification for African Languages☆18Updated 8 months ago
- Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentati…☆35Updated last year
- Awesome Question Answering☆28Updated 2 years ago