Fast bare-bones BPE for modern tokenizer training
☆178Jun 23, 2025Updated 11 months ago
Alternatives and similar repositories for bpeasy
Users that are interested in bpeasy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple Byte pair Encoding mechanism used for tokenization process . written purely in C☆147Nov 11, 2024Updated last year
- The official PyTorch implementation of Google's Gemma models☆5,672May 30, 2025Updated last year
- JAX implementation ViT-VQGAN☆64Jul 23, 2022Updated 3 years ago
- UNet diffusion model in pure CUDA☆657Jun 28, 2024Updated last year
- RuLES: a benchmark for evaluating rule-following in language models☆253Feb 24, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆1,058Apr 27, 2025Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- ScriptBots is an Open Source Evolutionary Artificial Life Simulation of Predator-Prey dynamics, written by Andrej Karpathy.☆65Feb 18, 2011Updated 15 years ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,496Jul 1, 2024Updated last year
- GPT for FACodec☆13Mar 25, 2024Updated 2 years ago
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆600May 13, 2026Updated 2 weeks ago
- ☆20Apr 26, 2026Updated last month
- Code for the paper "Getting the most out of your tokenizer for pre-training and domain adaptation"☆22Feb 14, 2024Updated 2 years ago
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27May 16, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- useful scripts to work with Twitter + Python. Requires the tweepy library.☆88Nov 29, 2012Updated 13 years ago
- Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch☆31May 11, 2026Updated 2 weeks ago
- 0-Shot Tokenizer Transplant☆14May 16, 2025Updated last year
- Implements SFO minibatch optimizer in Python and MATLAB, and reproduces figures from paper.☆136May 17, 2021Updated 5 years ago
- JavaScript with Batteries Included for Google Glass☆219Jul 10, 2016Updated 9 years ago
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆27Nov 25, 2024Updated last year
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆29Apr 17, 2024Updated 2 years ago
- Benchmark testbed for assessing the performance of optimisation algorithms☆85Jan 7, 2015Updated 11 years ago
- ☆12Mar 17, 2026Updated 2 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆10Apr 8, 2024Updated 2 years ago
- Schedule-Free Optimization in PyTorch☆2,296May 18, 2026Updated last week
- gpt-2 from scratch in mlx☆428Jun 12, 2024Updated last year
- Due to the huge vocaburary size (151,936) of Qwen models, the Embedding and LM Head weights are excessively heavy. Therefore, this projec…☆39Jan 6, 2026Updated 4 months ago
- Code for Zero-Shot Tokenizer Transfer☆144Jan 14, 2025Updated last year
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆959Nov 16, 2025Updated 6 months ago
- Google Mirror API's Quickstart for Python☆351Jun 13, 2021Updated 4 years ago
- Supervoice diffusion enhance☆28Jul 15, 2024Updated last year
- ☆10Oct 2, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Fine-tune mistral-7B on 3090s, a100s, h100s☆728Oct 11, 2023Updated 2 years ago
- Simple MPI implementation for prototyping or learning☆315Aug 6, 2025Updated 9 months ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,856Apr 18, 2025Updated last year
- Code for co-training large language models (e.g. T0) with smaller ones (e.g. BERT) to boost few-shot performance☆16Sep 23, 2022Updated 3 years ago
- Cramming the training of a (BERT-type) language model into limited compute.☆1,366Jun 13, 2024Updated last year
- Using fourier interpolation to merge large language models☆11Jan 6, 2026Updated 4 months ago
- Tile primitives for speedy kernels☆3,377May 22, 2026Updated last week