bminixhofer / tokenkit
A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.
☆20Updated this week
Alternatives and similar repositories for tokenkit
Users that are interested in tokenkit are comparing it to the libraries listed below
Sorting:
- Code for Zero-Shot Tokenizer Transfer☆127Updated 4 months ago
- ☆57Updated 7 months ago
- Lightweight tools for quick and easy LLM demo's☆26Updated 7 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆97Updated 3 weeks ago
- ☆49Updated 2 months ago
- 🕸 GlotCC Dataset and Pipline -- NeurIPS 2024☆18Updated last month
- Lego for GRPO☆28Updated last month
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆140Updated 2 months ago
- Pre-train Static Word Embeddings☆60Updated last month
- ☆51Updated 6 months ago
- ☆48Updated 6 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated last month
- minimalistic AI library that resembles HF's transformers☆13Updated 4 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆61Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 8 months ago
- Codebase accompanying the Summary of a Haystack paper.☆78Updated 7 months ago
- ☆25Updated 4 months ago
- ☆33Updated 10 months ago
- Simple GRPO scripts and configurations.☆58Updated 3 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆82Updated last week
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆98Updated 2 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 3 months ago
- This is the official repository for Inheritune.☆111Updated 3 months ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆30Updated 2 months ago
- An introduction to LLM Sampling☆78Updated 5 months ago
- ☆39Updated last week
- ☆47Updated 8 months ago
- ☆43Updated 3 months ago
- ☆57Updated this week