MorenoLaQuatra / bart-it
Pre-training BART model for the Italian Language
☆15Updated last year
Related projects ⓘ
Alternatives and complementary repositories for bart-it
- PyTorch reimplementation of REALM and ORQA☆22Updated 2 years ago
- A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.☆70Updated 3 months ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆31Updated 2 years ago
- ACL 2022: Just Rank: Rethinking Evaluation with Word and Sentence Similarities☆36Updated last year
- Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"☆46Updated 2 years ago
- A curated list of research papers and resources on Cultural LLM.☆23Updated last month
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Updated 3 years ago
- Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 🐑 🐑☆13Updated 7 months ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆53Updated 3 months ago
- ☆10Updated 2 years ago
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆28Updated 5 months ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆70Updated 8 months ago
- SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects☆16Updated 9 months ago
- NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings☆53Updated 5 months ago
- An official implementation for the EMNLP 2023 Findings paper "Prompt-Based Editing for Text Style Transfer"☆12Updated 11 months ago
- Can LLMs generate code-mixed sentences through zero-shot prompting?☆11Updated last year
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆96Updated 7 months ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- ☆26Updated 10 months ago
- This is the official implementation of NeurIPS 2021 "One Question Answering Model for Many Languages with Cross-lingual Dense Passage Ret…☆70Updated 2 years ago
- Code for the paper "REV: Information-Theoretic Evaluation of Free-Text Rationales"☆14Updated last year
- Code for the paper "Getting the most out of your tokenizer for pre-training and domain adaptation"☆13Updated 9 months ago
- mSimCSE: Multilingual SimCSE☆33Updated 2 years ago
- Efficient Memory-Augmented Transformers☆34Updated last year
- ☆14Updated last year
- KETOD Knowledge-Enriched Task-Oriented Dialogue☆31Updated last year
- [EMNLP'21] Mirror-BERT: Converting Pretrained Language Models to universal text encoders without labels.☆76Updated 2 years ago
- Mutual Information Predicts Hallucinations in Abstractive Summarization☆11Updated 2 years ago
- ☆29Updated last year
- Evaluation pipeline for the BabyLM Challenge 2023.☆72Updated last year