helpmefindaname/transformer-smaller-training-vocab

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/helpmefindaname/transformer-smaller-training-vocab)

helpmefindaname / transformer-smaller-training-vocab

Temporary remove unused tokens during training to save ram and speed.

☆23

Alternatives and similar repositories for transformer-smaller-training-vocab

Users that are interested in transformer-smaller-training-vocab are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yuweihao / LV-BERT
View on GitHub
LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)
☆18May 10, 2023Updated 3 years ago
ahmetustun / hyperx
View on GitHub
☆21Dec 5, 2022Updated 3 years ago
UniversalNER / UniversalNER
View on GitHub
☆28Apr 19, 2026Updated 3 months ago
gautierdag / tokenizer-bench
View on GitHub
Code for the paper "Getting the most out of your tokenizer for pre-training and domain adaptation"
☆22Feb 14, 2024Updated 2 years ago
coastalcph / histnorm
View on GitHub
Compiled tools, datasets, and other resources for historical text normalization.
☆21Jun 18, 2019Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
cisnlp / MEXA
View on GitHub
[ACL 2025] 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment
☆11Apr 6, 2025Updated last year
LZhengisme / CODA
View on GitHub
Implementation of Cascaded Head-colliding Attention (ACL'2021)
☆11Sep 16, 2021Updated 4 years ago
marcotchen / SimpleGPT
View on GitHub
[ICML 2026] Improving GPT via a simple normalization strategy
☆15May 22, 2026Updated 2 months ago
WalterSimoncini / SeqAttack
View on GitHub
A framework for adversarial attacks against token classification models
☆33Nov 6, 2021Updated 4 years ago
ltgoslo / simple_elmo_training
View on GitHub
Minimal code to train ELMo models in recent versions of TensorFlow
☆14Jun 16, 2026Updated last month
pdufter / densray
View on GitHub
Getting interpretable dimensions in word embedding spaces.
☆15Jul 6, 2023Updated 3 years ago
boschresearch / adversarial_meta_embeddings
View on GitHub
Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"
☆13Dec 14, 2021Updated 4 years ago
doccano / spacy-partial-tagger
View on GitHub
A simple library for training named entity recognition model from partially annotated data
☆24Nov 12, 2023Updated 2 years ago
melanietosik / maxent-ner-tagger
View on GitHub
Maximum entropy named-entity recognition (NER)
☆13Dec 8, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
plassma / symbolic-music-discrete-diffusion
View on GitHub
☆50Aug 21, 2023Updated 2 years ago
applicaai / pyramidions
View on GitHub
This repository contains a demonstrative implementation for pooling-based models, e.g., DeepPyramidion complementing our paper "Sparsifyi…
☆14May 15, 2022Updated 4 years ago
zalandoresearch / SWARM
View on GitHub
Set-Equivariant Deep Learning Models
☆22Dec 23, 2021Updated 4 years ago
xinjli / phonepiece
View on GitHub
phone inventory library
☆17May 15, 2023Updated 3 years ago
rhasspy / phonetisaurus-pypi
View on GitHub
Python wrapper for phonetisaurus grapheme to phoneme tool
☆12Mar 11, 2021Updated 5 years ago
mojsaeed / RuleBert
View on GitHub
☆20Mar 30, 2022Updated 4 years ago
XuhuiZhou / CDA
View on GitHub
code for our EMNLP2020 paper: Multilevel Text Alignment with Cross-Document Attention by Xuhui Zhou, Nikolaos Pappas, and Noah A. Smith
☆14May 18, 2021Updated 5 years ago
YuxianMeng / CorefQA-pytorch
View on GitHub
A PyTorch implementation of the CorefQA Model.
☆10Jun 27, 2020Updated 6 years ago
PacktPublishing / Natural-Language-Processing-with-Flair
View on GitHub
Natural Language Processing with Flair, published by Packt
☆26Mar 2, 2026Updated 4 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
yahshibu / nested-ner-tacl2020-flair
View on GitHub
Implementation of Nested Named Entity Recognition using Flair
☆24Oct 29, 2021Updated 4 years ago
nateraw / spaces-docker-templates
View on GitHub
🚀🤗 A collection of templates for Hugging Face Spaces
☆35Oct 9, 2023Updated 2 years ago
AndreasMadsen / course-02456-sparsemax
View on GitHub
TensorFlow and Numpy implementation of sparsemax
☆15Dec 22, 2019Updated 6 years ago
hroptatyr / sample
View on GitHub
Produce a sample of lines from files.
☆20Jul 2, 2022Updated 4 years ago
allenai / staged-training
View on GitHub
Staged Training for Transformer Language Models
☆33Mar 31, 2022Updated 4 years ago
svmiller / stevethemes
View on GitHub
Steve's {ggplot2} themes and related theme elements
☆12Apr 27, 2023Updated 3 years ago
impresso / named-entity-tutorial-dh2019
View on GitHub
Tutorial on NE processing for Digital Humanities - DH Utrech 2019
☆24Jul 18, 2019Updated 7 years ago
Splend1d / T5lephone
View on GitHub
Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5
☆19Nov 29, 2022Updated 3 years ago
UKP-SQuARE / square-core
View on GitHub
SQuARE: Software for question answering research.
☆75Jun 25, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
NathanGodey / headless-lm
View on GitHub
Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…
☆29Apr 17, 2024Updated 2 years ago
Uehwan / Incremental-Learning
View on GitHub
Incremental Learning with Adaptive Resonance Theory (ART) & Developmental Resonance networks
☆12Dec 18, 2019Updated 6 years ago
milkymap / map2gpt
View on GitHub
This project is a versatile and powerful search tool that leverages state-of-the-art natural language processing models to provide releva…
☆12Apr 3, 2023Updated 3 years ago
adapter-hub / hgiyt
View on GitHub
Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"
☆28Oct 3, 2021Updated 4 years ago
shavarani / SpEL
View on GitHub
Structured Prediction for Entity Linking
☆39Aug 2, 2024Updated last year
ocropus-archive / DUP-cctc
View on GitHub
Simple CTC implementation for PyTorch
☆14Oct 25, 2017Updated 8 years ago
ZihanWangKi / CrossWeigh
View on GitHub
CrossWeigh: Training Named Entity Tagger from Imperfect Annotations
☆177Jul 25, 2024Updated last year