MorenoLaQuatra / bart-it
Pre-training BART model for the Italian Language
☆15Updated last year
Related projects ⓘ
Alternatives and complementary repositories for bart-it
- An official implementation for the EMNLP 2023 Findings paper "Prompt-Based Editing for Text Style Transfer"☆12Updated 11 months ago
- PyTorch reimplementation of REALM and ORQA☆22Updated 2 years ago
- Vocabulary Trimming (VT) is a model compression technique, which reduces a multilingual LM vocabulary to a target language by deleting ir…☆30Updated 2 weeks ago
- Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 🐑 🐑☆12Updated 6 months ago
- Measuring the Mixing of Contextual Information in the Transformer☆25Updated last year
- Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"☆46Updated 2 years ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆70Updated 8 months ago
- M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer☆55Updated last year
- A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.☆70Updated 3 months ago
- A library for minimum Bayes risk (MBR) decoding☆29Updated 2 weeks ago
- ☆19Updated last year
- ☆11Updated 2 years ago
- Evaluation pipeline for the BabyLM Challenge 2023.☆72Updated last year
- ☆14Updated 11 months ago
- ☆10Updated 2 years ago
- [EMNLP'21] Mirror-BERT: Converting Pretrained Language Models to universal text encoders without labels.☆75Updated 2 years ago
- A curated list of research papers and resources on Cultural LLM.☆23Updated last month
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆96Updated 6 months ago
- The official implementation of "Distilling Relation Embeddings from Pre-trained Language Models, EMNLP 2021 main conference", a high-qual…☆46Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆44Updated last year
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- Code for EMNLP 2023 findings paper "A Closer Look into Using Large Language Models for Automatic Evaluation"☆18Updated last year
- Code for the paper "REV: Information-Theoretic Evaluation of Free-Text Rationales"☆14Updated last year
- ☆14Updated 3 weeks ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆31Updated 2 years ago
- The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Deco…☆32Updated 9 months ago
- Are foundation LMs multilingual knowledge bases? (EMNLP 2023)☆18Updated 11 months ago
- Code of NAACL 2022 "Efficient Hierarchical Domain Adaptation for Pretrained Language Models" paper.☆32Updated last year
- Interpretable unified language safety checking with large language models☆30Updated last year
- CSCW 2023 Best Demo Award: Conversational AI Explanations to Support Human-AI Scientific Writing☆13Updated last year