kuprel / minbpe-pytorch
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization, with PyTorch/CUDA
☆35Updated 6 months ago
Related projects: ⓘ
- NLP with Rust for Python 🦀🐍☆57Updated 3 months ago
- [WIP] Transformer to embed Danbooru labelsets☆13Updated 5 months ago
- ☆48Updated 6 months ago
- Latent Large Language Models☆16Updated 3 weeks ago
- ☆19Updated last month
- ☆65Updated 2 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆51Updated 7 months ago
- Alice in Wonderland code base for experiments and raw experiments data☆96Updated last week
- ☆34Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆63Updated 9 months ago
- ☆40Updated 2 months ago
- A library for incremental loading of large PyTorch checkpoints☆56Updated last year
- ☆25Updated this week
- ☆21Updated 3 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆62Updated last year
- Fast approximate inference on a single GPU with sparsity aware offloading☆39Updated 8 months ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆21Updated 2 months ago
- Make triton easier☆39Updated 3 months ago
- Testing LLM reasoning abilities with family relationship quizzes.☆40Updated 2 weeks ago
- Simple and fast low-bit matmul kernels in CUDA☆48Updated this week
- A place to store reusable transformer components of my own creation or found on the interwebs☆43Updated 3 weeks ago
- ☆29Updated 3 weeks ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆44Updated 10 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆117Updated 8 months ago
- Github repo for Peifeng's internship project☆12Updated 10 months ago
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆94Updated 2 weeks ago
- implementation of https://arxiv.org/pdf/2312.09299☆19Updated 2 months ago
- ☆36Updated last year
- ☆62Updated 5 months ago
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated last year