timoschick / one-token-approximation
This repository contains the code for applying One-Token Approximation to a pretrained language model using subword-level tokenization.
☆11Updated 4 years ago
Alternatives and similar repositories for one-token-approximation:
Users that are interested in one-token-approximation are comparing it to the libraries listed below
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆26Updated 3 years ago
- ☆24Updated last year
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- ☆46Updated 5 years ago
- REALSumm: Re-evaluating Evaluation in Text Summarization☆71Updated 2 years ago
- The NLPStatTest project☆12Updated 2 years ago
- Pretraining scripts for BART transformer model☆11Updated last year
- ☆16Updated 3 years ago
- ☆24Updated 5 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆63Updated 4 years ago
- PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"☆24Updated 3 years ago
- End-to-end shallow discourse parser☆20Updated last year
- Code for the paper "Improving Robustness of Machine Translation with Synthetic Noise"☆21Updated 5 years ago
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆37Updated 2 years ago
- EMNLP DiscoEval paper☆42Updated 5 years ago
- This repository contains the WordNet Language Model Probing (WNLaMPro) dataset introduced in "Rare Words: A Major Problem for Contextuali…☆14Updated 5 years ago
- We are creating a challenging new benchmark MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models. Retrieval quest…☆31Updated 4 years ago
- Tool to perform paired evaluation of automatic systems☆12Updated 3 years ago
- Frame-Semantic and PropBank Semantic Role Labeling with Syntactic Scaffolding.☆50Updated 3 years ago
- Code and Data for our EMNLP 2020 paper titled 'Learning to Explain: Datasets and Models for Identifying Valid Reasoning Chains in Multiho…☆28Updated 3 years ago
- Analyzing mBERT's multilinguality in a small laboratory setting☆13Updated last year
- ☆25Updated 2 years ago
- Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.☆18Updated 3 years ago
- ☆15Updated 3 years ago
- Entity Evaluation code☆21Updated 5 years ago
- Code and data for: Low Resource Grammatical Error Correction Using Wikipedia Edits (WNUT 2018)☆14Updated 7 months ago
- ☆37Updated 3 years ago
- A software for transferring pre-trained English models to foreign languages☆18Updated last year
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆30Updated 2 years ago
- Automatically detect errors in annotated corpora.☆47Updated last year