Fast bare-bones BPE for modern tokenizer training
☆179Jun 23, 2025Updated 11 months ago
Alternatives and similar repositories for bpeasy
Users that are interested in bpeasy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple Byte pair Encoding mechanism used for tokenization process . written purely in C☆150Nov 11, 2024Updated last year
- The official PyTorch implementation of Google's Gemma models☆5,693May 30, 2025Updated last year
- ☆10Sep 30, 2015Updated 10 years ago
- RuLES: a benchmark for evaluating rule-following in language models☆255Feb 24, 2025Updated last year
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆1,060Apr 27, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- ScriptBots is an Open Source Evolutionary Artificial Life Simulation of Predator-Prey dynamics, written by Andrej Karpathy.☆66Feb 18, 2011Updated 15 years ago
- GPT for FACodec☆13Mar 25, 2024Updated 2 years ago
- ####### ALERT! #########: my fork of the project has moved:☆18Dec 23, 2016Updated 9 years ago
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆603May 13, 2026Updated last month
- A basic implementation of convolutional neural nets☆59Apr 20, 2014Updated 12 years ago
- useful scripts to work with Twitter + Python. Requires the tweepy library.☆88Nov 29, 2012Updated 13 years ago
- Implements SFO minibatch optimizer in Python and MATLAB, and reproduces figures from paper.☆136May 17, 2021Updated 5 years ago
- JavaScript with Batteries Included for Google Glass☆219Jul 10, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆27Nov 25, 2024Updated last year
- Extracts plain text, language identification and more metadata from WARC records☆23Apr 16, 2026Updated 2 months ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆29Apr 17, 2024Updated 2 years ago
- Benchmark testbed for assessing the performance of optimisation algorithms☆85Jan 7, 2015Updated 11 years ago
- ☆12Mar 17, 2026Updated 3 months ago
- ☆10Apr 8, 2024Updated 2 years ago
- Schedule-Free Optimization in PyTorch☆2,304Updated this week
- gpt-2 from scratch in mlx☆434Jun 12, 2024Updated 2 years ago
- Code for Zero-Shot Tokenizer Transfer☆145Jan 14, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆16Apr 4, 2022Updated 4 years ago
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆964Nov 16, 2025Updated 7 months ago
- Supervoice diffusion enhance☆28Jul 15, 2024Updated last year
- ☆10Oct 2, 2024Updated last year
- Experimental CUDA kernel framework unifying typed dimensions, NVRTC JIT specialization, and ML‑guided tuning.☆46Feb 9, 2026Updated 4 months ago
- Measuring if attention is explanation with ROAR☆22Mar 3, 2023Updated 3 years ago
- A fusion of a linear layer and a cross entropy loss, written for pytorch in triton.☆75Aug 2, 2024Updated last year
- Ruby Gem that makes sure that only a single instance of a code block is running.☆16Mar 13, 2013Updated 13 years ago
- Simple MPI implementation for prototyping or learning☆319Aug 6, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,864Apr 18, 2025Updated last year
- Implementation of Diffusion Transformer (DiT) in JAX☆317Jun 11, 2024Updated 2 years ago
- Cramming the training of a (BERT-type) language model into limited compute.☆1,367Jun 13, 2024Updated 2 years ago
- mReasoner is a unified computational implementation of the model theory of thinking and reasoning☆15Aug 17, 2023Updated 2 years ago
- ☆53Feb 10, 2025Updated last year
- ☆55Aug 22, 2025Updated 9 months ago
- Minimalistic large language model 3D-parallelism training☆2,720May 26, 2026Updated 3 weeks ago