This repository contains the code for applying One-Token Approximation to a pretrained language model using subword-level tokenization.
☆11May 7, 2020Updated 5 years ago
Alternatives and similar repositories for one-token-approximation
Users that are interested in one-token-approximation are comparing it to the libraries listed below
Sorting:
- This repository contains the WordNet Language Model Probing (WNLaMPro) dataset introduced in "Rare Words: A Major Problem for Contextuali…☆14Feb 2, 2020Updated 6 years ago
- ☆12Mar 20, 2020Updated 6 years ago
- Research into identifying and correcting incorrect labels in the CoNLL-2003 corpus.☆12May 11, 2021Updated 4 years ago
- Implementation of the paper 'Sentence Bottleneck Autoencoders from Transformer Language Models'☆17Mar 14, 2022Updated 4 years ago
- Unsupervised Domain Adaptation of Contextualized Embeddings for Sequence Labeling☆46Sep 3, 2019Updated 6 years ago
- Code and data for "Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words"☆18Aug 17, 2021Updated 4 years ago
- Core repository of the retico framework providing the basic functionality of incremental processing.☆11Mar 12, 2026Updated last week
- Code and data for: Low Resource Grammatical Error Correction Using Wikipedia Edits (WNUT 2018)☆17Jul 16, 2024Updated last year
- A software for transferring pre-trained English models to foreign languages☆19Mar 20, 2023Updated 3 years ago
- Aggregation for complex labels, described in https://www.ischool.utexas.edu/~ml/papers/braylan_web2020.pdf☆16Jun 7, 2024Updated last year
- ☆20Nov 24, 2019Updated 6 years ago
- DKPro WSD: A Java framework for word sense disambiguation☆20Nov 16, 2022Updated 3 years ago
- Implementation of the paper "Fine-Tuning Transformers: Vocabulary Transfer" https://arxiv.org/pdf/2112.14569.pdf☆20Dec 28, 2021Updated 4 years ago
- code for the paper "Cluster & Tune: Boost Cold Start Performance in Text Classification" for ACL2022☆28May 18, 2022Updated 3 years ago
- Template and steps to build your personal blog using Jekyll and Minimal Mistake☆10Feb 24, 2020Updated 6 years ago
- A Pre-trained BERT on StackOverflow Corpus☆47Feb 27, 2021Updated 5 years ago
- Codes for the paper "Grad2Task: Improved Few-shot Text Classification Using Gradients for Task Representation"☆14Nov 24, 2022Updated 3 years ago
- Code for "Deep Energy-Based Modeling of Discrete-Time Physics," NeurIPS, 2020. (Oral)☆19Jan 30, 2022Updated 4 years ago
- Code for ACL '19 paper: Towards Improving Neural Named Entity Recognition with Gazetteers☆32Jul 2, 2021Updated 4 years ago
- ☆30Sep 27, 2021Updated 4 years ago
- Implementations of polygamma, lgamma, and beta functions for PyTorch☆24Jul 8, 2017Updated 8 years ago
- Re-Implementation of SPARTA model☆13Oct 1, 2021Updated 4 years ago
- A tiny BERT for low-resource monolingual models☆31Dec 24, 2025Updated 2 months ago
- decontamination☆27Mar 4, 2026Updated 2 weeks ago
- ☆13Dec 5, 2024Updated last year
- code for "Self-supervised edge features for improved Graph Neural Network training", <arxivlink>☆24Dec 14, 2020Updated 5 years ago
- ☆11Nov 19, 2020Updated 5 years ago
- This repository contains PyTorch implementations of various random feature maps for dot product kernels.☆22Jul 13, 2024Updated last year
- A publishing website of a table collecting meta-learning-related papers in the area of human language processing.☆17Aug 2, 2021Updated 4 years ago
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Jul 11, 2023Updated 2 years ago
- A text normalization framework using GBM and human-generated features☆10Feb 4, 2020Updated 6 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆189Aug 17, 2021Updated 4 years ago
- Implementation of the Rotation Forest by Rodriques et al. 2006☆28Feb 6, 2024Updated 2 years ago
- Sinkhorn Barycenters via Frank-Wolfe algorithm☆27Feb 3, 2020Updated 6 years ago
- ☆10Sep 27, 2021Updated 4 years ago
- ☆18Apr 16, 2021Updated 4 years ago
- ☆10Dec 17, 2020Updated 5 years ago
- This repository contains the code for "How many data points is a prompt worth?"☆48Apr 7, 2021Updated 4 years ago
- ☆10Oct 2, 2024Updated last year