timoschick / one-token-approximation
This repository contains the code for applying One-Token Approximation to a pretrained language model using subword-level tokenization.
☆11Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for one-token-approximation
- Analyzing mBERT's multilinguality in a small laboratory setting☆13Updated last year
- ☆46Updated 4 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆26Updated 3 years ago
- Fine-tuned Transformers compatible BERT models for Sequence Tagging☆40Updated 4 years ago
- ☆24Updated last year
- ☆32Updated 3 years ago
- ☆25Updated 9 months ago
- ☆23Updated 4 years ago
- RelEx - A simple framework for Relation Extraction built on AllenNLP☆16Updated 4 years ago
- Statistics on multilingual datasets☆17Updated 2 years ago
- PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"☆24Updated 3 years ago
- Code and data for: Low Resource Grammatical Error Correction Using Wikipedia Edits (WNUT 2018)☆12Updated 4 months ago
- [ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction☆13Updated 4 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆63Updated 4 years ago
- Code and Data for Evaluation WG☆41Updated 2 years ago
- Tool to perform paired evaluation of automatic systems☆12Updated 3 years ago
- Cross-lingual TRansfer Evaluation of Multilingual Encoders (XTREME)☆22Updated 4 years ago
- We are creating a challenging new benchmark MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models. Retrieval quest…☆30Updated 4 years ago
- Code to support the paper "Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets"☆66Updated 3 years ago
- ☆30Updated 4 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆82Updated last month
- The NLPStatTest project☆11Updated 2 years ago
- Entity Evaluation code☆21Updated 5 years ago
- Frame-Semantic and PropBank Semantic Role Labeling with Syntactic Scaffolding.☆50Updated 3 years ago
- ☆20Updated 4 years ago
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆37Updated 2 years ago
- Pretraining scripts for BART transformer model☆11Updated last year
- Adaptive Passage Encoder for Open-domain Question Answering☆15Updated 3 years ago
- A coreference evaluation package for the CoNLL and ARRAU datasets☆40Updated 4 years ago