marta1994 / efficient_bpe_explanationLinks

This repository provides a clear, educational implementation of Byte Pair Encoding (BPE) tokenization in plain Python. The focus is on algorithmic understanding, not raw performance.

☆13

Alternatives and similar repositories for efficient_bpe_explanation

Users that are interested in efficient_bpe_explanation are comparing it to the libraries listed below

Sorting:

inseq-team / inseq
Interpretability for sequence generation models 🐛 🔍
☆451Updated this week
center-for-humans-and-machines / transformer-heads
Toolkit for attaching, training, saving and loading of new heads for transformer models
☆294Updated 10 months ago
LambdaLabsML / distributed-training-guide
Best practices & guides on how to write distributed pytorch training code
☆562Updated 2 months ago
predibase / llm_distillation_playbook
Best practices for distilling large language models.
☆596Updated last year
bkitano / llama-from-scratch
Llama from scratch, or How to implement a paper without crying
☆582Updated last year
NVIDIA / logits-processor-zoo
A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.
☆380Updated 6 months ago
bobazooba / xllm
🦖 X—LLM: Cutting Edge & Easy LLM Finetuning
☆407Updated last year
hkproj / multi-latent-attention
☆45Updated 7 months ago
EleutherAI / cookbook
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
☆829Updated 5 months ago
huggingface / text-clustering
Easily embed, cluster and semantically label text datasets
☆587Updated last year
lightonai / pylate
Late Interaction Models Training & Retrieval
☆679Updated this week
Elma-dev / TODa
TODa: Tamazight Open Dataset
☆16Updated 11 months ago
melisa-writer / short-transformers
Prune transformer layers
☆74Updated last year
pacman100 / LLM-Workshop
LLM Workshop by Sourab Mangrulkar
☆400Updated last year
decoderesearch / SAELens
Training Sparse Autoencoders on Language Models
☆1,144Updated this week
IINemo / lm-polygraph
☆414Updated this week
srush / Transformer-Puzzles
Puzzles for exploring transformers
☆382Updated 2 years ago
davanstrien / awesome-synthetic-datasets
awesome synthetic (text) datasets
☆320Updated this week
srush / LLM-Training-Puzzles
What would you do with 1000 H100s...
☆1,143Updated 2 years ago
AnswerDotAI / ModernBERT
Bringing BERT into modernity via both architecture changes and scaling
☆1,607Updated 6 months ago
chujiezheng / chat_templates
Chat Templates for 🤗 HuggingFace Large Language Models
☆711Updated last year
ndif-team / nnsight
The nnsight package enables interpreting and manipulating the internals of deep learned models.
☆758Updated this week
gautierdag / bpeasy
Fast bare-bones BPE for modern tokenizer training
☆174Updated 6 months ago
stanfordnlp / pyvene
Stanford NLP Python library for understanding and improving PyTorch models via interventions
☆849Updated 2 months ago
huggingface / llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
☆278Updated last year
callummcdougall / ARENA_3.0
☆872Updated last month
jsbaan / transformer-from-scratch
Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.
☆273Updated last year
tcapelle / llm_recipes
A set of scripts and notebooks on LLM finetunning and dataset creation
☆113Updated last year
EleutherAI / sparsify
Sparsify transformers with SAEs and transcoders
☆681Updated 2 weeks ago
huggingface / cosmopedia
☆559Updated last year