maroxtn / mt5-M2M-comparison
Comparing M2M and mT5 on a rare language pairs, blog post: https://medium.com/@abdessalemboukil/comparing-facebooks-m2m-to-mt5-in-low-resources-translation-english-yoruba-ef56624d2b75
☆15Updated 3 years ago
Alternatives and similar repositories for mt5-M2M-comparison:
Users that are interested in mt5-M2M-comparison are comparing it to the libraries listed below
- Common crawl pretrained sentencepiece tokenizers for English and Japanese for various vocabulary sizes. Also development environment for …☆10Updated 3 years ago
- Helper scripts and notes that were used while porting various nlp models☆46Updated 3 years ago
- ☆14Updated 5 months ago
- Awesome Question Answering☆28Updated 2 years ago
- Using short models to classify long texts☆21Updated 2 years ago
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆68Updated last year
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- Crosslingual Question Answering for African Languages☆29Updated 6 months ago
- Abstractive and Extractive Text summarization using Transformers.☆83Updated last year
- Use Google's state-of-the-art T5 pre-train model to create human-like summarization☆25Updated 3 years ago
- ☆38Updated 2 years ago
- An example of multilingual machine translation using a pretrained version of mt5 from Hugging Face.☆42Updated 4 years ago
- This project aims at creating a search engine based on BERT language model.☆20Updated 4 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆102Updated 2 years ago
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+☆37Updated 4 years ago
- ☆11Updated 2 years ago
- ☆19Updated 2 years ago
- ☆51Updated last year
- Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)☆22Updated last year
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …☆81Updated 2 years ago
- Observe the slow deterioration of my mental sanity in the github commit history☆12Updated last year
- minimal LLM scripts for 24GB VRAM GPUs. training, inference, whatever☆38Updated last month
- This repository contains code for the paper "Meet Your Favorite Character: Open-domain Chatbot Mimicking Fictional Characters with only a…☆13Updated 2 years ago
- Some notebooks for NLP☆200Updated last year
- A collection of preprocessed datasets and pretrained models for generating paraphrases.☆29Updated 3 years ago
- A web application that interfaces two GEC systems. [web instance is down]☆31Updated 8 months ago
- Code for the paper "Getting the most out of your tokenizer for pre-training and domain adaptation"☆16Updated last year
- ☆15Updated last year
- Developing tools to automatically analyze datasets☆74Updated 5 months ago
- MAFAND-MT☆55Updated 9 months ago