microsoft / Lightweight-Low-Resource-NMT
Official code for "Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Developing Lightweight Low-Resource MT Models" to appear in WMT 2022.
☆17Updated last year
Alternatives and similar repositories for Lightweight-Low-Resource-NMT:
Users that are interested in Lightweight-Low-Resource-NMT are comparing it to the libraries listed below
- CyBERTron-LM is a project which collects some pre-trained Transformer-based models.☆12Updated last year
- We release the UICaption dataset. The dataset consists of UI images (icons and screenshots) and associated text descriptions. This datase…☆38Updated 2 years ago
- Fault-aware neural code rankers☆28Updated 2 years ago
- UNISUMM: Unified Few-shot Summarization with Multi-Task Pre-Training and Prefix-Tuning☆60Updated last year
- ☆84Updated last year
- ☆22Updated last year
- DeFacto - Demonstrations and Feedback for improving factual consistency of text summarization☆30Updated 2 years ago
- ☆14Updated last year
- A self-supervised learning approach based on extremely large masking☆30Updated 2 years ago
- Diffusion-based markup-to-image generation☆81Updated 2 years ago
- ☆27Updated last year
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆100Updated last year
- ☆18Updated last month
- ☆44Updated 3 years ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆71Updated last year
- An instruction-based benchmark for text improvements.☆141Updated 2 years ago
- NTREX -- News Test References for MT Evaluation☆83Updated 10 months ago
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆81Updated last year
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆272Updated 3 months ago
- ☆19Updated 2 years ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- Building modular LMs with parameter-efficient fine-tuning.☆103Updated last week
- ☆13Updated this week
- Evaluation pipeline for the BabyLM Challenge 2023.☆75Updated last year
- ☆51Updated 2 years ago
- ☆97Updated 2 years ago
- ☆65Updated last year
- Index of URLs to pdf files all over the internet and scripts☆23Updated last year
- ☆77Updated last year
- Source code for the GPT-2 story generation models in the EMNLP 2020 paper "STORIUM: A Dataset and Evaluation Platform for Human-in-the-Lo…☆39Updated last year