tatHi / maxmatch_dropout
☆10Updated 2 years ago
Alternatives and similar repositories for maxmatch_dropout
Users that are interested in maxmatch_dropout are comparing it to the libraries listed below
Sorting:
- A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆16Updated last year
- ☆21Updated last year
- ☆11Updated 2 years ago
- The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).☆43Updated 2 years ago
- PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"☆25Updated 3 years ago
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Updated 3 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆27Updated 3 years ago
- ☆25Updated 2 years ago
- The geometry of multilingual language model representations (EMNLP 2022).☆20Updated 2 years ago
- Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks☆63Updated 3 years ago
- Bootstrapped Unsupervised Sentence Representation Learning (ACL 2021)☆30Updated 3 years ago
- The official implementation for ACL 2021 "Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval".☆27Updated 3 years ago
- ACL 2023 short: Balancing Lexical and Semantic Quality in Abstractive Summarization☆15Updated last year
- TBC☆27Updated 2 years ago
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆41Updated 2 years ago
- Script to pre-train hugginface transformers BART with Tensorflow 2☆33Updated 2 years ago
- ☆24Updated 2 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Updated 3 years ago
- ☆30Updated last year
- ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost☆41Updated last year
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆21Updated 8 months ago
- Don't Judge a Language Model by Its Last Layer: Contrastive Learning with Layer-Wise Attention Pooling☆9Updated 2 years ago
- ☆41Updated 4 years ago
- Source code for "SimCKP: Simple Contrastive Learning of Keyphrase Representations", Findings of EMNLP 2023☆11Updated last year
- Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking☆13Updated 2 years ago
- Code for the paper "Attention Temperature Matters in Abstractive Summarization Distillation"(https://arxiv.org/abs/2106.03441)☆13Updated 3 years ago
- This repository includes the masking vocabulary used in the ICLR 2021 spotlight PMI-Masking paper☆14Updated 3 years ago
- Benchmark API for Multidomain Language Modeling☆24Updated 2 years ago
- Mutual Information Predicts Hallucinations in Abstractive Summarization☆12Updated 2 years ago
- ☆44Updated 3 years ago