timoschick / one-token-approximation
This repository contains the code for applying One-Token Approximation to a pretrained language model using subword-level tokenization.
☆11Updated 5 years ago
Alternatives and similar repositories for one-token-approximation
Users that are interested in one-token-approximation are comparing it to the libraries listed below
Sorting:
- ☆24Updated last year
- ☆46Updated 5 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆63Updated 4 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆27Updated 3 years ago
- REALSumm: Re-evaluating Evaluation in Text Summarization☆71Updated 2 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- Analyzing mBERT's multilinguality in a small laboratory setting☆13Updated last year
- Fine-tuned Transformers compatible BERT models for Sequence Tagging☆40Updated 4 years ago
- ☆37Updated 3 years ago
- ☆31Updated 4 years ago
- We are creating a challenging new benchmark MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models. Retrieval quest…☆31Updated 4 years ago
- Statistics on multilingual datasets☆17Updated 2 years ago
- RelEx - A simple framework for Relation Extraction built on AllenNLP☆16Updated 4 years ago
- ☆68Updated 2 weeks ago
- ☆20Updated 4 years ago
- ☆25Updated last year
- Code and datasets for the EMNLP 2020 paper "Calibration of Pre-trained Transformers"☆59Updated last year
- ☆24Updated 5 years ago
- Official code for the paper "PERL: Pivot-based Domain Adaptation for Pre-trained Deep Contextualized Embedding Models".☆16Updated 2 years ago
- ☆13Updated 4 years ago
- ☆76Updated 3 years ago
- EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling☆31Updated 3 years ago
- ☆11Updated 4 years ago
- ☆40Updated 4 years ago
- This repository contains the WordNet Language Model Probing (WNLaMPro) dataset introduced in "Rare Words: A Major Problem for Contextuali…☆14Updated 5 years ago
- Code to support the paper "Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets"☆66Updated 3 years ago
- SUPERT: Unsupervised multi-document summarization evaluation & generation☆94Updated 2 years ago
- ACL'2020: Contextualized Sparse Representations for Real-Time Open-Domain Question Answering☆49Updated 4 years ago
- Repro is a library for easily running code from published papers via Docker.☆40Updated last year
- Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.☆18Updated 4 years ago