gautierdag / bpeasyView external linksLinks
Fast bare-bones BPE for modern tokenizer training
☆175Jun 23, 2025Updated 7 months ago
Alternatives and similar repositories for bpeasy
Users that are interested in bpeasy are comparing it to the libraries listed below
Sorting:
- Simple Byte pair Encoding mechanism used for tokenization process . written purely in C☆146Nov 11, 2024Updated last year
- The official PyTorch implementation of Google's Gemma models☆5,602May 30, 2025Updated 8 months ago
- ☆10Apr 8, 2024Updated last year
- JAX implementation ViT-VQGAN☆63Jul 23, 2022Updated 3 years ago
- UNet diffusion model in pure CUDA☆661Jun 28, 2024Updated last year
- Google+ Blog☆15Oct 9, 2011Updated 14 years ago
- GPT for FACodec☆13Mar 25, 2024Updated last year
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27May 16, 2024Updated last year
- ☆19Sep 16, 2025Updated 4 months ago
- ☆16Apr 4, 2022Updated 3 years ago
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆1,042Apr 27, 2025Updated 9 months ago
- ☆16Dec 31, 2021Updated 4 years ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,309Jul 1, 2024Updated last year
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆596Aug 12, 2025Updated 6 months ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆28Apr 17, 2024Updated last year
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆26Nov 25, 2024Updated last year
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆57Oct 31, 2023Updated 2 years ago
- Pure Python version of the mlabwrap Python to Matlab bridge☆31Nov 21, 2019Updated 6 years ago
- Experimental CUDA kernel framework unifying typed dimensions, NVRTC JIT specialization, and ML‑guided tuning.☆46Updated this week
- Fast, free, easy, and object-agnostic video anonymization☆11Dec 12, 2020Updated 5 years ago
- ☆10Oct 2, 2024Updated last year
- Supervoice diffusion enhance☆28Jul 15, 2024Updated last year
- ScriptBots is an Open Source Evolutionary Artificial Life Simulation of Predator-Prey dynamics, written by Andrej Karpathy.☆62Feb 18, 2011Updated 14 years ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆156Jul 14, 2025Updated 7 months ago
- BH hackathon☆14Apr 4, 2024Updated last year
- ☆16Feb 18, 2024Updated last year
- [NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search☆17Jan 24, 2026Updated 3 weeks ago
- 0-Shot Tokenizer Transplant☆14May 16, 2025Updated 8 months ago
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)☆14Aug 6, 2025Updated 6 months ago
- 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment☆11Apr 6, 2025Updated 10 months ago
- ☆13May 30, 2024Updated last year
- ☆12Feb 22, 2024Updated last year
- FINALLY: Fast and universal speech enhancement model delivering studio-quality audio for a wide range of recordings.☆25Dec 11, 2025Updated 2 months ago
- Using fourier interpolation to merge large language models☆11Jan 6, 2026Updated last month
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆15Oct 16, 2023Updated 2 years ago
- Schedule-Free Optimization in PyTorch☆2,256May 21, 2025Updated 8 months ago
- Due to the huge vocaburary size (151,936) of Qwen models, the Embedding and LM Head weights are excessively heavy. Therefore, this projec…☆32Jan 6, 2026Updated last month
- Minimalistic large language model 3D-parallelism training☆2,544Dec 11, 2025Updated 2 months ago
- Manipulating semantic data within Python☆18Jan 14, 2025Updated last year