unicamp-dl / Lite-T5-TranslationLinks
☆28Updated 2 years ago
Alternatives and similar repositories for Lite-T5-Translation
Users that are interested in Lite-T5-Translation are comparing it to the libraries listed below
Sorting:
- Code for training and evaluating T5 on Portuguese data.☆90Updated 3 years ago
- Portuguese translation of the GLUE benchmark and Scitail dataset☆32Updated 3 years ago
- Evaluation and baseline scripts for the ASSIN shared task.☆11Updated 6 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 3 years ago
- Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks☆62Updated 4 years ago
- ☆11Updated 2 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆99Updated 10 months ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆28Updated 4 years ago
- A software for transferring pre-trained English models to foreign languages☆19Updated 2 years ago
- ☆184Updated 2 years ago
- ☆16Updated 3 years ago
- A multilingual version of MS MARCO passage ranking dataset☆146Updated 2 years ago
- ☆12Updated 3 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Updated 4 years ago
- Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/☆29Updated 4 years ago
- PorSimplesSent - A Portuguese corpus of aligned sentences pairs to investigate sentence readability assessment☆13Updated 6 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 3 years ago
- ☆60Updated 3 years ago
- MT Evaluation in Many Languages via Zero-Shot Paraphrasing☆102Updated last year
- A python true casing utility that restores case information for texts☆88Updated 3 years ago
- We introduce MKQA, an open-domain question answering evaluation set comprising 10k question-answer pairs aligned across 26 typologically …☆191Updated 3 years ago
- ☆68Updated 8 months ago
- Resources for the "CTRLsum: Towards Generic Controllable Text Summarization" paper☆147Updated 8 months ago
- BERT models for many languages created from Wikipedia texts☆33Updated 5 years ago
- Pretraining scripts for BART transformer model☆12Updated 2 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆189Updated 4 years ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆74Updated last year
- Long-context pretrained encoder-decoder models☆96Updated 3 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆120Updated 4 years ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆22Updated 2 years ago