timoschick / one-token-approximationLinks
This repository contains the code for applying One-Token Approximation to a pretrained language model using subword-level tokenization.
☆11Updated 5 years ago
Alternatives and similar repositories for one-token-approximation
Users that are interested in one-token-approximation are comparing it to the libraries listed below
Sorting:
- Fine-tuned Transformers compatible BERT models for Sequence Tagging☆40Updated 5 years ago
- ☆46Updated 5 years ago
- SUPERT: Unsupervised multi-document summarization evaluation & generation☆95Updated 2 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆64Updated 5 years ago
- Analyzing mBERT's multilinguality in a small laboratory setting☆13Updated 2 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆27Updated 3 years ago
- Hyperparameter Search for AllenNLP☆139Updated 6 months ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- Codebase, data and models for the Keep it Simple paper at ACL2021☆38Updated 2 years ago
- Code and data for the paper "Soft Gazetteers for Low-resource Named Entity Recognition"☆19Updated 4 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆87Updated 3 months ago
- Codebase for probing and visualizing multilingual models.☆49Updated 5 years ago
- Tool to perform paired evaluation of automatic systems☆12Updated 3 years ago
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆145Updated 2 years ago
- Pytorch implementation of Highly Parallel Autoregressive Entity Linking with Discriminative Correction☆67Updated 3 years ago
- We are creating a challenging new benchmark MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models. Retrieval quest…☆31Updated 5 years ago
- Pretraining scripts for BART transformer model☆12Updated 2 years ago
- Massively Multilingual Transfer for NER☆86Updated 3 years ago
- ☆32Updated 4 years ago
- A coreference evaluation package for the CoNLL and ARRAU datasets☆40Updated 4 years ago
- PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"☆25Updated 4 years ago
- The NLPStatTest project☆12Updated 3 years ago
- ☆13Updated 4 years ago
- ☆87Updated 3 years ago
- Code to reproduce the experiments from the paper.☆101Updated last year
- ☆75Updated 4 years ago
- ☆22Updated 3 years ago
- ☆21Updated 4 years ago
- ☆25Updated 5 years ago
- Lexical Simplification with Pretrained Encoders☆70Updated 4 years ago