tatHi / maxmatch_dropoutLinks
☆10Updated 3 years ago
Alternatives and similar repositories for maxmatch_dropout
Users that are interested in maxmatch_dropout are comparing it to the libraries listed below
Sorting:
- Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks☆63Updated 3 years ago
- ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost☆42Updated 2 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆27Updated 4 years ago
- A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆18Updated 2 years ago
- Official repository for "Reweighting Strategy based on Synthetic Data Identification for Sentence Similarity (COLING2022)"☆18Updated 3 years ago
- ☆25Updated 3 years ago
- The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).☆45Updated 3 years ago
- Source code for "SimCKP: Simple Contrastive Learning of Keyphrase Representations", Findings of EMNLP 2023☆12Updated 6 months ago
- PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"☆26Updated 4 years ago
- The implementation of <Factual Consistency Evaluation for Text Summarization via Counterfactual Estimation> in PyTorch.☆17Updated 4 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Updated 3 years ago
- Test code of Inverse cloze task for information retrieval☆33Updated 4 years ago
- ☆31Updated last year
- Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking☆13Updated 2 years ago
- This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.☆48Updated 3 years ago
- Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"☆25Updated 6 months ago
- ☆21Updated 2 years ago
- ☆15Updated 3 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆44Updated last year
- Long-context pretrained encoder-decoder models☆96Updated 3 years ago
- This repository includes the masking vocabulary used in the ICLR 2021 spotlight PMI-Masking paper☆14Updated 4 years ago
- [ICLR 2022] Towards Continual Knowledge Learning of Language Models☆92Updated 3 years ago
- ☆22Updated 3 years ago
- M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer☆54Updated 3 years ago
- The official implementation for ACL 2021 "Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval".☆28Updated 4 years ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆39Updated 3 years ago
- TBC☆28Updated 3 years ago
- [ACL 2021] Learning to Perturb Word Embeddings for Out-of-distribution QA☆16Updated 3 years ago
- MINERS ⛏️: The semantic retrieval benchmark for evaluating multilingual language models. (EMNLP 2024 Findings)☆14Updated last year
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Updated 4 years ago