Fast bare-bones BPE for modern tokenizer training
☆177Jun 23, 2025Updated 9 months ago
Alternatives and similar repositories for bpeasy
Users that are interested in bpeasy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official PyTorch implementation of Google's Gemma models☆5,655May 30, 2025Updated 10 months ago
- JAX implementation ViT-VQGAN☆63Jul 23, 2022Updated 3 years ago
- UNet diffusion model in pure CUDA☆657Jun 28, 2024Updated last year
- RuLES: a benchmark for evaluating rule-following in language models☆248Feb 24, 2025Updated last year
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆1,053Apr 27, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Pure Python version of the mlabwrap Python to Matlab bridge☆31Nov 21, 2019Updated 6 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,417Jul 1, 2024Updated last year
- GPT for FACodec☆13Mar 25, 2024Updated 2 years ago
- ####### ALERT! #########: my fork of the project has moved:☆17Dec 23, 2016Updated 9 years ago
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆600Aug 12, 2025Updated 8 months ago
- ☆19Sep 16, 2025Updated 7 months ago
- Teardown of Google Glass☆39Jan 11, 2014Updated 12 years ago
- Code for the paper "Getting the most out of your tokenizer for pre-training and domain adaptation"☆22Feb 14, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A basic implementation of convolutional neural nets☆59Apr 20, 2014Updated 11 years ago
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27May 16, 2024Updated last year
- 0-Shot Tokenizer Transplant☆14May 16, 2025Updated 11 months ago
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆27Nov 25, 2024Updated last year
- Extracts plain text, language identification and more metadata from WARC records☆23Oct 1, 2025Updated 6 months ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆29Apr 17, 2024Updated 2 years ago
- Benchmark testbed for assessing the performance of optimisation algorithms☆86Jan 7, 2015Updated 11 years ago
- ☆12Mar 17, 2026Updated last month
- ☆10Apr 8, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Schedule-Free Optimization in PyTorch☆2,274May 21, 2025Updated 10 months ago
- gpt-2 from scratch in mlx☆423Jun 12, 2024Updated last year
- Due to the huge vocaburary size (151,936) of Qwen models, the Embedding and LM Head weights are excessively heavy. Therefore, this projec…☆36Jan 6, 2026Updated 3 months ago
- Code for Zero-Shot Tokenizer Transfer☆144Jan 14, 2025Updated last year
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆957Nov 16, 2025Updated 5 months ago
- Supervoice diffusion enhance☆28Jul 15, 2024Updated last year
- ☆10Oct 2, 2024Updated last year
- Experimental CUDA kernel framework unifying typed dimensions, NVRTC JIT specialization, and ML‑guided tuning.☆46Feb 9, 2026Updated 2 months ago
- Fine-tune mistral-7B on 3090s, a100s, h100s☆726Oct 11, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Measuring if attention is explanation with ROAR☆22Mar 3, 2023Updated 3 years ago
- A fusion of a linear layer and a cross entropy loss, written for pytorch in triton.☆75Aug 2, 2024Updated last year
- Ruby Gem that makes sure that only a single instance of a code block is running.☆16Mar 13, 2013Updated 13 years ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,834Apr 18, 2025Updated last year
- Implementation of Diffusion Transformer (DiT) in JAX☆311Jun 11, 2024Updated last year
- ☆12Jun 27, 2024Updated last year
- Cramming the training of a (BERT-type) language model into limited compute.☆1,362Jun 13, 2024Updated last year