helpmefindaname / transformer-smaller-training-vocabView external linksLinks
Temporary remove unused tokens during training to save ram and speed.
☆23Jun 15, 2025Updated 8 months ago
Alternatives and similar repositories for transformer-smaller-training-vocab
Users that are interested in transformer-smaller-training-vocab are comparing it to the libraries listed below
Sorting:
- Code for the paper "Getting the most out of your tokenizer for pre-training and domain adaptation"☆21Feb 14, 2024Updated 2 years ago
- 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment☆11Apr 6, 2025Updated 10 months ago
- Implementation of Cascaded Head-colliding Attention (ACL'2021)☆11Sep 16, 2021Updated 4 years ago
- ☆28Feb 24, 2025Updated 11 months ago
- Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"☆13Dec 14, 2021Updated 4 years ago
- Getting interpretable dimensions in word embedding spaces.☆15Jul 6, 2023Updated 2 years ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Apr 30, 2023Updated 2 years ago
- Maximum entropy named-entity recognition (NER)☆13Dec 8, 2022Updated 3 years ago
- ☆20Mar 30, 2022Updated 3 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆28Apr 17, 2024Updated last year
- A collection of notebooks for Natural Language Processing☆25Jan 13, 2025Updated last year
- A tool for benchmarking LLMs on Modal☆46Aug 29, 2025Updated 5 months ago
- Staged Training for Transformer Language Models☆33Mar 31, 2022Updated 3 years ago
- Implementation of Nested Named Entity Recognition using Flair☆24Oct 29, 2021Updated 4 years ago
- 🚀🤗 A collection of templates for Hugging Face Spaces☆35Oct 9, 2023Updated 2 years ago
- Code and models for the paper titled "Better Feature Integration for Named Entity Recognition", NAACL 2021.☆30Nov 5, 2021Updated 4 years ago
- PyTorch-IE: State-of-the-art Information Extraction in PyTorch☆77Sep 24, 2025Updated 4 months ago
- A framework for adversarial attacks against token classification models☆33Nov 6, 2021Updated 4 years ago
- [NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)☆34Aug 6, 2023Updated 2 years ago
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆31Dec 5, 2022Updated 3 years ago
- Meta Representation Transformation for Low-resource Cross-lingual Learning☆41May 5, 2021Updated 4 years ago
- RATransformers 🐭- Make your transformer (like BERT, RoBERTa, GPT-2 and T5) Relation Aware!☆42Dec 14, 2022Updated 3 years ago
- Structured Prediction for Entity Linking☆38Aug 2, 2024Updated last year
- [NeurIPS 2025] Let LRMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆55Nov 4, 2025Updated 3 months ago
- ☆75Jul 2, 2021Updated 4 years ago
- ☆35Aug 4, 2021Updated 4 years ago
- Named entity recognition for the legal domain☆43Jun 1, 2021Updated 4 years ago
- A tutorial on Bayesian multilevel modeling using R and Stan.☆14Nov 19, 2021Updated 4 years ago
- Handles OpenDocument files and translates them to HTML.☆10Oct 8, 2019Updated 6 years ago
- ☆10Oct 2, 2024Updated last year
- Code for "Fine-Tuned 'Small' LLMs (Still) Significantly Outperform Zero-Shot Generative AI Models in Text Classification", arXiv 2024☆13Jun 24, 2024Updated last year
- Redis distributed lock implementation for Python based on Pub/Sub messaging☆11Nov 15, 2025Updated 3 months ago
- NLP Examples using the 🤗 libraries☆40Feb 21, 2021Updated 4 years ago
- Linear Attention for Efficient Bidirectional Sequence Modeling☆15May 13, 2025Updated 9 months ago
- A memory allocator that aims to eliminate dangling pointer vulnerabilities at a low overhead, using virtualisation via Dune. My Computer …☆10Nov 27, 2019Updated 6 years ago
- Simple-to-use scoring function for arbitrarily tokenized texts.☆47Feb 19, 2025Updated 11 months ago
- Statistical discontinuous constituent parsing☆11Feb 15, 2018Updated 8 years ago
- An all-in-one R package for the assessment of linguistic similarity☆11Oct 6, 2025Updated 4 months ago
- Collection of iPython notebooks with some quick demos☆11May 25, 2017Updated 8 years ago