MorenoLaQuatra / bart-itLinks

Pre-training BART model for the Italian Language

☆16

Alternatives and similar repositories for bart-it

Users that are interested in bart-it are comparing it to the libraries listed below

Sorting:

Helsinki-NLP / OPUS-MT-testsets
benchmarks for evaluating MT models
☆12Updated last year
ltgoslo / gpt-bert
Official implementation of "GPT or BERT: why not both?"
☆55Updated this week
qqaatw / pytorch-realm-orqa
PyTorch reimplementation of REALM and ORQA
☆22Updated 3 years ago
Rojak-NLP / LLM-Code-Mixing
Can LLMs generate code-mixed sentences through zero-shot prompting?
☆11Updated 2 years ago
hlt-mt / FBK-fairseq
Repository containing the open source code of works published at the FBK MT unit.
☆47Updated last week
bigscience-workshop / multilingual-modeling
BLOOM+1: Adapting BLOOM model to support a new unseen language
☆73Updated last year
amazon-science / masked-diffusion-lm
Official implementation for the paper "A Cheaper and Better Diffusion Language Model with Soft-Masked Noise"
☆58Updated last year
Silin159 / ComFact
☆17Updated 3 months ago
mojave-pku / UniPrompt
☆10Updated 2 years ago
liuzeming01 / XDailyDialog
https://liuzeming01.github.io/XDailyDialog/
☆10Updated 2 years ago
szhang42 / Calibration_qa
☆10Updated 3 years ago
cooelf / CompassMTL
Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)
☆22Updated 2 years ago
lgessler / microbert
A tiny BERT for low-resource monolingual models
☆31Updated 9 months ago
john-hewitt / backpacks-flash-attn
The original Backpack Language Model implementation, a fork of FlashAttention
☆69Updated 2 years ago
BinWang28 / EvalRank-Embedding-Evaluation
ACL 2022: Just Rank: Rethinking Evaluation with Word and Sentence Similarities
☆35Updated 2 years ago
ahmetustun / hyperx
☆20Updated 2 years ago
microsoft / CodeMixed-Text-Generator
This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalenc…
☆55Updated 11 months ago
cliang1453 / SAGE
No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)
☆30Updated 3 years ago
lu-wo / whisbert
babyLM WhisBERT code
☆20Updated last year
ErikEkstedt / datasets_turntaking
Datasets for turn-taking research
☆15Updated last year
MANGA-UOFA / Prompt-Edit
An official implementation for the EMNLP 2023 Findings paper "Prompt-Based Editing for Text Style Transfer"
☆12Updated last year
Observeai-Research / Phoneme-BERT
☆34Updated 4 years ago
ictnlp / Seq-NAT
Source code for <Sequence-Level Training for Non-Autoregressive Neural Machine Translation>.
☆24Updated 3 years ago
yxuansu / Contrastive_Search_Is_What_You_Need
[TMLR'23] Contrastive Search Is What You Need For Neural Text Generation
☆119Updated 2 years ago
tylerachang / word-acquisition-language-models
Word acquisition in neural language models (TACL 2022).
☆16Updated 5 months ago
uds-lsv / MCSE
NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings
☆55Updated last year
jungokasai / twist_decoding
☆29Updated 3 years ago
formiel / speech-translation
Multilingual speech translation
☆41Updated 4 years ago
asahi417 / relbert
The official implementation of "Distilling Relation Embeddings from Pre-trained Language Models, EMNLP 2021 main conference", a high-qual…
☆47Updated 7 months ago
babylm / evaluation-pipeline-2023
Evaluation pipeline for the BabyLM Challenge 2023.
☆76Updated last year