timoschick / one-token-approximationLinks
This repository contains the code for applying One-Token Approximation to a pretrained language model using subword-level tokenization.
☆11Updated 5 years ago
Alternatives and similar repositories for one-token-approximation
Users that are interested in one-token-approximation are comparing it to the libraries listed below
Sorting:
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆64Updated 5 years ago
- Fine-tuned Transformers compatible BERT models for Sequence Tagging☆40Updated 5 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆27Updated 4 years ago
- SUPERT: Unsupervised multi-document summarization evaluation & generation☆96Updated 3 years ago
- ☆32Updated 4 years ago
- Analyzing mBERT's multilinguality in a small laboratory setting☆13Updated 2 years ago
- Hyperparameter Search for AllenNLP☆140Updated 10 months ago
- ☆13Updated 4 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 4 years ago
- ☆46Updated 5 years ago
- Codebase for probing and visualizing multilingual models.☆49Updated 5 years ago
- ☆21Updated 5 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆90Updated 7 months ago
- PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"☆26Updated 4 years ago
- We are creating a challenging new benchmark MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models. Retrieval quest…☆31Updated 5 years ago
- ☆25Updated last year
- The NLPStatTest project☆12Updated 3 years ago
- ☆68Updated 8 months ago
- Tool to perform paired evaluation of automatic systems☆13Updated 4 years ago
- Code to reproduce the experiments from the paper.☆103Updated 2 years ago
- ☆88Updated 4 years ago
- Pretraining scripts for BART transformer model☆12Updated 2 years ago
- Code to support the paper "Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets"☆64Updated 4 years ago
- This is the official repository for NAACL 2021, "XOR QA: Cross-lingual Open-Retrieval Question Answering".☆80Updated 4 years ago
- Code and data for the paper "Soft Gazetteers for Low-resource Named Entity Recognition"☆19Updated 5 years ago
- ☆75Updated 4 years ago
- Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"☆14Updated 3 years ago
- A coreference evaluation package for the CoNLL and ARRAU datasets☆41Updated 5 years ago
- State of the art Semantic Sentence Embeddings☆100Updated 3 years ago
- ☆29Updated 6 years ago