LEYADEV / Vocabulary-Transfer
Implementation of the paper "Fine-Tuning Transformers: Vocabulary Transfer" https://arxiv.org/pdf/2112.14569.pdf
☆20Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Vocabulary-Transfer
- ☆73Updated 3 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆26Updated 3 years ago
- Automatic metrics for GEM tasks☆61Updated 2 years ago
- This is a repository for the paper on testing inductive bias with scaled-down RoBERTa models.☆19Updated 2 years ago
- Codebase, data and models for the Keep it Simple paper at ACL2021☆36Updated last year
- Implementation of the paper 'Plug and Play Autoencoders for Conditional Text Generation'☆42Updated 3 years ago
- We are creating a challenging new benchmark MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models. Retrieval quest…☆30Updated 4 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- Implementation of the paper 'Sentence Bottleneck Autoencoders from Transformer Language Models'☆17Updated 2 years ago
- This repository contains the code for the Form-Context Model and its Attentive Mimicking variant.☆31Updated 4 years ago
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16Updated 2 years ago
- The dataset and code for ACL 2022 paper "SciNLI: A Corpus for Natural Language Inference on Scientific Text" are released here.☆25Updated last year
- ☆70Updated 3 years ago
- ☆32Updated 3 years ago
- Code and data for the NAACL 2021 paper: "XFORMAL: A Benchmark for Multilingual Formality Style Transfer"☆12Updated 3 years ago
- Codebase for probing and visualizing multilingual models.☆45Updated 4 years ago
- ☆38Updated 4 years ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆54Updated 2 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆31Updated 2 years ago
- Few-shot NLP benchmark for unified, rigorous eval☆91Updated 2 years ago
- MediaSum: A Large-scale Media Interview Dataset for Dialogue Summarization☆65Updated 3 years ago
- Code and Data for Evaluation WG☆41Updated 2 years ago
- Hyperparameter Search for AllenNLP☆134Updated 4 years ago
- Code for Massive-scale Decoding for Text Generation using Lattices☆42Updated 2 years ago
- Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks☆63Updated 2 years ago
- ☆42Updated 3 years ago
- PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"☆24Updated 3 years ago
- REALSumm: Re-evaluating Evaluation in Text Summarization☆71Updated last year
- codebase for the Text-based NP Enrichment (TNE) paper☆19Updated 8 months ago
- This is the official repository for NAACL 2021, "XOR QA: Cross-lingual Open-Retrieval Question Answering".☆79Updated 3 years ago