bminixhofer / tokenkitLinks

A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.

☆33

Alternatives and similar repositories for tokenkit

Users that are interested in tokenkit are comparing it to the libraries listed below

Sorting:

bminixhofer / zett
Code for Zero-Shot Tokenizer Transfer
☆133Updated 6 months ago
OpenEvaByte / evabyte
EvaByte: Efficient Byte-level Language Models at Scale
☆103Updated 2 months ago
sanyalsunny111 / LLM-Inheritune
This is the official repository for Inheritune.
☆112Updated 5 months ago
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated 10 months ago
Alex-Gurung / ReasoningNCP
Official repo for Learning to Reason for Long-Form Story Generation
☆67Updated 3 months ago
allenai / infinigram-api
☆70Updated this week
SeunghyunSEO / optimized_hf_llama_class_for_training
☆48Updated 10 months ago
vicksEmmanuel / latent-gemma
☆26Updated 6 months ago
Zyphra / Zyda_processing
☆36Updated last year
arcee-ai / DAM
☆53Updated 8 months ago
NathanGodey / qfilters
Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)
☆33Updated 4 months ago
PythonNut / superbpe
Official code release for "SuperBPE: Space Travel for Language Models"
☆61Updated this week
Tomorrowdawn / top_nsigma
The official code repo and data hub of top_nsigma sampling strategy for LLMs.
☆26Updated 5 months ago
mungg / FABLES
☆57Updated 9 months ago
catie-aq / flashT5
A fast implementation of T5/UL2 in PyTorch using Flash Attention
☆105Updated 4 months ago
SalesforceAIResearch / GemFilter
☆82Updated 6 months ago
sail-sg / sailcraft
🚢 Data Toolkit for Sailor Language Models
☆94Updated 4 months ago
euclaise / supertrainer2000
☆49Updated last year
TristanThrush / i-am-a-strange-dataset
Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"
☆44Updated last year
dwzhu-pku / PoSE
Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)
☆205Updated last year
siyan-zhao / prepacking
The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS …
☆60Updated 9 months ago
marzenakrp / nocha
☆52Updated 8 months ago
luyug / magix
Supercharge huggingface transformers with model parallelism.
☆77Updated 9 months ago
nahidalam / maya
Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya
☆117Updated this week
argilla-io / distilabel-spin-dibt
Repository containing the SPIN experiments on the DIBT 10k ranked prompts
☆24Updated last year
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆140Updated 5 months ago
jxmorris12 / cde
code for training & evaluating Contextual Document Embedding models
☆194Updated 2 months ago
r-three / RAD
Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model
☆43Updated last year
kernelmachine / cbtm
Code repository for the c-BTM paper
☆107Updated last year
Digitous / LLM-SLERP-Merge
Spherical Merge Pytorch/HF format Language Models with minimal feature loss.
☆132Updated last year