timoschick / one-token-approximationLinks
This repository contains the code for applying One-Token Approximation to a pretrained language model using subword-level tokenization.
☆11Updated 5 years ago
Alternatives and similar repositories for one-token-approximation
Users that are interested in one-token-approximation are comparing it to the libraries listed below
Sorting:
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆27Updated 4 years ago
 - Codebase for probing and visualizing multilingual models.☆49Updated 5 years ago
 - ☆13Updated 4 years ago
 - ☆46Updated 5 years ago
 - EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
 - SUPERT: Unsupervised multi-document summarization evaluation & generation☆95Updated 2 years ago
 - ☆32Updated 4 years ago
 - Analyzing mBERT's multilinguality in a small laboratory setting☆13Updated 2 years ago
 - This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆64Updated 5 years ago
 - Tool to perform paired evaluation of automatic systems☆12Updated 4 years ago
 - Fine-tuned Transformers compatible BERT models for Sequence Tagging☆40Updated 5 years ago
 - PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"☆25Updated 4 years ago
 - Pretraining scripts for BART transformer model☆12Updated 2 years ago
 - SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆146Updated 3 years ago
 - ☆68Updated 6 months ago
 - ☆25Updated last year
 - We are creating a challenging new benchmark MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models. Retrieval quest…☆31Updated 5 years ago
 - REALSumm: Re-evaluating Evaluation in Text Summarization☆72Updated last month
 - ☆92Updated 4 years ago
 - Hyperparameter Search for AllenNLP☆140Updated 7 months ago
 - This is the official repository for NAACL 2021, "XOR QA: Cross-lingual Open-Retrieval Question Answering".☆80Updated 4 years ago
 - The NLPStatTest project☆12Updated 3 years ago
 - GMEG☆31Updated 11 months ago
 - ☆25Updated 5 years ago
 - A framework for evaluating Machine Translation models.☆11Updated 5 months ago
 - 🐸 KERMIT - A lightweight library to encode and interpret Universal Syntactic Embeddings☆58Updated 2 years ago
 - ☆75Updated 4 years ago
 - Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆88Updated 5 months ago
 - Code and data for the paper "Soft Gazetteers for Low-resource Named Entity Recognition"☆19Updated 5 years ago
 - Codebase, data and models for the Keep it Simple paper at ACL2021☆39Updated 2 years ago