timoschick/one-token-approximation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/timoschick/one-token-approximation)

timoschick / one-token-approximation

This repository contains the code for applying One-Token Approximation to a pretrained language model using subword-level tokenization.

☆12

Alternatives and similar repositories for one-token-approximation

Users that are interested in one-token-approximation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

timoschick / am-for-bert
View on GitHub
This repository contains the WordNet Language Model Probing (WNLaMPro) dataset introduced in "Rare Words: A Major Problem for Contextuali…
☆14Feb 2, 2020Updated 6 years ago
Jiangtong-Li / Subword-ELMo
View on GitHub
☆12Mar 20, 2020Updated 6 years ago
almogtavor / SE-KD3x
View on GitHub
🚄 Fast, accurate & memory efficient LLM distillation via tokens, classes and samples selection
☆20Feb 3, 2026Updated 5 months ago
CODAIT / Identifying-Incorrect-Labels-In-CoNLL-2003
View on GitHub
Research into identifying and correcting incorrect labels in the CoNLL-2003 corpus.
☆12May 11, 2021Updated 5 years ago
ivanmontero / autobot
View on GitHub
Implementation of the paper 'Sentence Bottleneck Autoencoders from Transformer Language Models'
☆17Mar 14, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
valentinhofmann / superbizarre
View on GitHub
Code and data for "Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words"
☆18Aug 17, 2021Updated 4 years ago
xhan77 / AdaptaBERT
View on GitHub
Unsupervised Domain Adaptation of Contextualized Embeddings for Sequence Labeling
☆46Sep 3, 2019Updated 6 years ago
retico-team / retico-core
View on GitHub
Core repository of the retico framework providing the basic functionality of incremental processing.
☆12May 18, 2026Updated 2 months ago
adrianeboyd / boyd-wnut2018
View on GitHub
Code and data for: Low Resource Grammatical Error Correction Using Wikipedia Edits (WNUT 2018)
☆17Jul 16, 2024Updated 2 years ago
alexa / ramen
View on GitHub
A software for transferring pre-trained English models to foreign languages
☆20Mar 20, 2023Updated 3 years ago
Praznat / annotationmodeling
View on GitHub
Aggregation for complex labels, described in https://www.ischool.utexas.edu/~ml/papers/braylan_web2020.pdf
☆16Jun 7, 2024Updated 2 years ago
ZeweiChu / MQR
View on GitHub
☆20Nov 24, 2019Updated 6 years ago
LEYADEV / Vocabulary-Transfer
View on GitHub
Implementation of the paper "Fine-Tuning Transformers: Vocabulary Transfer" https://arxiv.org/pdf/2112.14569.pdf
☆20Dec 28, 2021Updated 4 years ago
IBM / intermediate-training-using-clustering
View on GitHub
code for the paper "Cluster & Tune: Boost Cold Start Performance in Text Classification" for ACL2022
☆27May 18, 2022Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
dkpro / dkpro-wsd
View on GitHub
DKPro WSD: A Java framework for word sense disambiguation
☆21Jun 30, 2026Updated 2 weeks ago
jixuan-wang / Grad2Task
View on GitHub
Codes for the paper "Grad2Task: Improved Few-shot Text Classification Using Gradients for Task Representation"
☆14Nov 24, 2022Updated 3 years ago
tksmatsubara / discrete-autograd
View on GitHub
Code for "Deep Energy-Based Modeling of Discrete-Time Physics," NeurIPS, 2020. (Oral)
☆19Jan 30, 2022Updated 4 years ago
maelfabien / build_your_blog.github.io
View on GitHub
Template and steps to build your personal blog using Jekyll and Minimal Mistake
☆10Feb 24, 2020Updated 6 years ago
lanwuwei / BERTOverflow
View on GitHub
A Pre-trained BERT on StackOverflow Corpus
☆46Feb 27, 2021Updated 5 years ago
nstawfik / MedSentEval
View on GitHub
☆11Nov 19, 2020Updated 5 years ago
MichaelLLi / Text_Normalization
View on GitHub
A text normalization framework using GBM and human-generated features
☆10Feb 4, 2020Updated 6 years ago
lyutyuh / gazetteer-NER-acl19
View on GitHub
Code for ACL '19 paper: Towards Improving Neural Named Entity Recognition with Gazetteers
☆32Jul 2, 2021Updated 5 years ago
huggingface / model_card
View on GitHub
☆30Sep 27, 2021Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
nreimers / beir-sparta
View on GitHub
Re-Implementation of SPARTA model
☆13Oct 1, 2021Updated 4 years ago
rachtsingh / lgamma
View on GitHub
Implementations of polygamma, lgamma, and beta functions for PyTorch
☆25Jul 8, 2017Updated 9 years ago
sagagugit / ProBASS
View on GitHub
☆13May 24, 2026Updated last month
lgessler / microbert
View on GitHub
A tiny BERT for low-resource monolingual models
☆32Dec 24, 2025Updated 6 months ago
justusrobertson / GME
View on GitHub
The General Mediation Engine (GME) is a software framework for producing interactive narratives using narrative mediation.
☆10Feb 4, 2018Updated 8 years ago
nealgravindra / self-supervsed_edge_feats
View on GitHub
code for "Self-supervised edge features for improved Graph Neural Network training", <arxivlink>
☆24Dec 14, 2020Updated 5 years ago
jeffeuxMartin / meta-learning-hlp
View on GitHub
A publishing website of a table collecting meta-learning-related papers in the area of human language processing.
☆17Aug 2, 2021Updated 4 years ago
joneswack / dp-rfs
View on GitHub
This repository contains PyTorch implementations of various random feature maps for dot product kernels.
☆22Jul 13, 2024Updated 2 years ago
joshloyal / RotationForest
View on GitHub
Implementation of the Rotation Forest by Rodriques et al. 2006
☆28Feb 6, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
timoschick / dino
View on GitHub
This repository contains the code for "Generating Datasets with Pretrained Language Models".
☆188Aug 17, 2021Updated 4 years ago
UKPLab / emnlp2021-prompt-ft-heuristics
View on GitHub
☆10Sep 27, 2021Updated 4 years ago
GiulsLu / Sinkhorn-Barycenters
View on GitHub
Sinkhorn Barycenters via Frank-Wolfe algorithm
☆26Feb 3, 2020Updated 6 years ago
huaxiuyao / KGML
View on GitHub
KGML for EMNLP 2021
☆10Feb 2, 2022Updated 4 years ago
Yuanhy1997 / HyPe
View on GitHub
HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]
☆14Jul 11, 2023Updated 3 years ago
pearapple123 / rime-hoisanva
View on GitHub
A RIME IME for Taishanese
☆11Aug 3, 2023Updated 2 years ago
UKPLab / TWEAC-qa-agent-selection
View on GitHub
☆20Apr 16, 2021Updated 5 years ago