kuprel / minbpe-pytorch
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization, with PyTorch/CUDA
☆36Updated last year
Alternatives and similar repositories for minbpe-pytorch:
Users that are interested in minbpe-pytorch are comparing it to the libraries listed below
- [WIP] Transformer to embed Danbooru labelsets☆13Updated last year
- ☆35Updated 2 years ago
- Because it's there.☆16Updated 7 months ago
- Lightweight tools for quick and easy LLM demo's☆26Updated 7 months ago
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- Port of Facebook's LLaMA model in C/C++☆20Updated last year
- ☆49Updated last year
- utilities for loading and running text embeddings with onnx☆44Updated 8 months ago
- assign color hues to a collection of text fragments based on embeddings☆20Updated 10 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- Experiments for efforts to train a new and improved t5☆77Updated last year
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 5 months ago
- Latent Large Language Models☆17Updated 7 months ago
- Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code☆10Updated last year
- ☆39Updated 2 years ago
- QLoRA with Enhanced Multi GPU Support☆37Updated last year
- Training hybrid models for dummies.☆20Updated 3 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆53Updated last year
- Simple GRPO scripts and configurations.☆58Updated 2 months ago
- look how they massacred my boy☆63Updated 6 months ago
- Full finetuning of large language models without large memory requirements☆94Updated last year
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs☆78Updated 9 months ago
- A Python library for automatically solving Abstraction and Reasoning Corpus (ARC) challenges using Claude and object-centric modeling.☆21Updated 3 months ago
- Make triton easier☆47Updated 10 months ago
- Chat Markup Language conversation library☆55Updated last year
- ☆53Updated 11 months ago
- ☆63Updated 6 months ago