timoschick / one-token-approximationLinks
This repository contains the code for applying One-Token Approximation to a pretrained language model using subword-level tokenization.
☆11Updated 5 years ago
Alternatives and similar repositories for one-token-approximation
Users that are interested in one-token-approximation are comparing it to the libraries listed below
Sorting:
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆88Updated 4 months ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆27Updated 4 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆64Updated 5 years ago
- ☆46Updated 5 years ago
- ☆75Updated 4 years ago
- Analyzing mBERT's multilinguality in a small laboratory setting☆13Updated 2 years ago
- Fine-tuned Transformers compatible BERT models for Sequence Tagging☆40Updated 5 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- Code to support the paper "Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets"☆66Updated 4 years ago
- Code and data for the paper "Soft Gazetteers for Low-resource Named Entity Recognition"☆19Updated 4 years ago
- Hyperparameter Search for AllenNLP☆139Updated 7 months ago
- Codebase for probing and visualizing multilingual models.☆49Updated 5 years ago
- SUPERT: Unsupervised multi-document summarization evaluation & generation☆95Updated 2 years ago
- This is the official repository for NAACL 2021, "XOR QA: Cross-lingual Open-Retrieval Question Answering".☆80Updated 4 years ago
- 🐸 KERMIT - A lightweight library to encode and interpret Universal Syntactic Embeddings☆58Updated 2 years ago
- ☆25Updated last year
- ☆68Updated 5 months ago
- PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"☆25Updated 4 years ago
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆145Updated 2 years ago
- We are creating a challenging new benchmark MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models. Retrieval quest…☆31Updated 5 years ago
- ☆25Updated 5 years ago
- ☆13Updated 4 years ago
- ☆50Updated 3 years ago
- ☆32Updated 4 years ago
- The NLPStatTest project☆12Updated 3 years ago
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆38Updated 3 years ago
- Massively Multilingual Transfer for NER☆86Updated 4 years ago
- Tool to perform paired evaluation of automatic systems☆12Updated 3 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Updated 4 years ago
- Code to reproduce the experiments from the paper.☆101Updated 2 years ago