This repository contains the code for applying One-Token Approximation to a pretrained language model using subword-level tokenization.
☆11May 7, 2020Updated 5 years ago
Alternatives and similar repositories for one-token-approximation
Users that are interested in one-token-approximation are comparing it to the libraries listed below
Sorting:
- Research into identifying and correcting incorrect labels in the CoNLL-2003 corpus.☆12May 11, 2021Updated 4 years ago
- This repository contains the WordNet Language Model Probing (WNLaMPro) dataset introduced in "Rare Words: A Major Problem for Contextuali…☆14Feb 2, 2020Updated 6 years ago
- ☆12Mar 20, 2020Updated 5 years ago
- Implementation of the paper 'Sentence Bottleneck Autoencoders from Transformer Language Models'☆17Mar 14, 2022Updated 3 years ago
- Code and data for "Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words"☆18Aug 17, 2021Updated 4 years ago
- A software for transferring pre-trained English models to foreign languages☆19Mar 20, 2023Updated 2 years ago
- Code and data for: Low Resource Grammatical Error Correction Using Wikipedia Edits (WNUT 2018)☆17Jul 16, 2024Updated last year
- Implementation of the paper "Fine-Tuning Transformers: Vocabulary Transfer" https://arxiv.org/pdf/2112.14569.pdf☆20Dec 28, 2021Updated 4 years ago
- Unsupervised Domain Adaptation of Contextualized Embeddings for Sequence Labeling☆46Sep 3, 2019Updated 6 years ago
- code for the paper "Cluster & Tune: Boost Cold Start Performance in Text Classification" for ACL2022☆28May 18, 2022Updated 3 years ago
- A tiny BERT for low-resource monolingual models☆31Dec 24, 2025Updated 2 months ago
- Template and steps to build your personal blog using Jekyll and Minimal Mistake☆10Feb 24, 2020Updated 6 years ago
- Code for ACL '19 paper: Towards Improving Neural Named Entity Recognition with Gazetteers☆32Jul 2, 2021Updated 4 years ago
- Linear Attention for Efficient Bidirectional Sequence Modeling☆15May 13, 2025Updated 9 months ago
- ☆10Oct 2, 2024Updated last year
- Javafx Icons lib☆15Aug 14, 2025Updated 6 months ago
- Implementing expectimax, alpha-beta pruning, and minimax algorithms in a game of Pacman☆11Jan 17, 2014Updated 12 years ago
- 【python】利用百度语音识别API,百度语音合成API,图灵机器人API实现简单的对话机器人☆10Mar 13, 2021Updated 4 years ago
- A Pre-trained BERT on StackOverflow Corpus☆47Feb 27, 2021Updated 5 years ago
- ☆30Sep 27, 2021Updated 4 years ago
- decontamination☆25Dec 3, 2025Updated 2 months ago
- This repository provides the source code used to automatically generate the book summarization datasets described in the paper titled "Ec…☆10Apr 14, 2025Updated 10 months ago
- Collection of description of concepts, procedures, and simple XSLT files for text processing, e.g. simplify InDesign documents (.idml) to…☆12Jan 9, 2020Updated 6 years ago
- ☆10Sep 13, 2022Updated 3 years ago
- Code for "Mind Your Inflections! Improving NLP for Non-Standard Englishes with Base-Inflection Encoding" (EMNLP 2020).☆11May 1, 2025Updated 10 months ago
- Label shift estimation for transfer difficulty with Familiarity.☆10Feb 4, 2025Updated last year
- Core repository of the retico framework providing the basic functionality of incremental processing.☆11Feb 17, 2026Updated last week
- ☆13Dec 5, 2024Updated last year
- 0-Shot Tokenizer Transplant☆14May 16, 2025Updated 9 months ago
- The General Mediation Engine (GME) is a software framework for producing interactive narratives using narrative mediation.☆10Feb 4, 2018Updated 8 years ago
- ☆10Dec 17, 2020Updated 5 years ago
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Jul 11, 2023Updated 2 years ago
- ☆10Sep 27, 2021Updated 4 years ago
- ☆11Nov 19, 2020Updated 5 years ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Nov 9, 2021Updated 4 years ago
- A text normalization framework using GBM and human-generated features☆10Feb 4, 2020Updated 6 years ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- Digitale Geisteswissenschaften rund um Graphentechnologien☆10Feb 12, 2026Updated 2 weeks ago
- ☆13Nov 28, 2025Updated 3 months ago