Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization, with PyTorch/CUDA
☆42Feb 27, 2024Updated 2 years ago
Alternatives and similar repositories for minbpe-pytorch
Users that are interested in minbpe-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ✒️ A gallery of experiments with Scalable Vector Graphics (SVG) and interactive visualizations.☆13Jan 6, 2023Updated 3 years ago
- Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code☆10Aug 29, 2023Updated 2 years ago
- Accompanying codebase for neuroscope.io, a website for displaying max activating dataset examples for language model neurons☆13Feb 13, 2023Updated 3 years ago
- Customizable charts made with TikZ and LaTeX3☆14Feb 11, 2023Updated 3 years ago
- ☆12Oct 23, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A small game demonstrating a grid distortion effect☆15Oct 5, 2021Updated 4 years ago
- A command line utility for doing polarization simulations☆17Aug 21, 2019Updated 6 years ago
- Export userdata from your reddit accounts. Submissions, comments, saved, upvoted contents are supported.☆23Oct 31, 2024Updated last year
- Transcribe with ease :D☆16Jun 21, 2023Updated 2 years ago
- A visionOS project that demonstrates how to scale a volume to account for Window Zoom changes☆18Apr 3, 2024Updated last year
- gpt completions in vscode☆35Mar 24, 2023Updated 3 years ago
- GPT4 Tokenizer Visualizer☆23May 21, 2023Updated 2 years ago
- Implementation of a multi-agent system for the modeling of carpooling in a city with one-way streets. Used Python and the Mesa package fo…☆14Jan 19, 2022Updated 4 years ago
- ☆28Jul 29, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This is a public repository for collecting excellent visualizations of knowledge and/or data.☆17Jan 6, 2021Updated 5 years ago
- ☆30Jul 17, 2023Updated 2 years ago
- Rapid prototyping GUI, and visual printf-style debugging for computer vision development.☆25Apr 18, 2022Updated 3 years ago
- IonSolver is a magnetohydrodynamic simulation software featuring an extended Lattice Boltzmann method and GPU acceleration☆22Nov 10, 2025Updated 4 months ago
- ChatGPT-History-Downloader is a Chrome/Edge extension to help download chat history with OpenAI ChatGPT☆29Aug 23, 2025Updated 7 months ago
- The Parrot stable and deterministic multi-threading system.☆25Nov 9, 2013Updated 12 years ago
- This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.☆35Oct 28, 2025Updated 4 months ago
- Windbg Utility Tools based upon PyKD☆42Sep 9, 2020Updated 5 years ago
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆40Aug 2, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Micro-framework for publishing linked data☆11Aug 1, 2017Updated 8 years ago
- ☆35Sep 13, 2023Updated 2 years ago
- Demo of fine-tuning QA models for answering FAQ of cloud providers documentation☆11Mar 7, 2023Updated 3 years ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆75Mar 21, 2026Updated last week
- Code and experiments for the COLING2020 paper "Conception: Multilingually-Enhanced, Human-Readable Concept Vector Representations".☆11Dec 9, 2020Updated 5 years ago
- Real probability scales for matplotlib☆40Nov 3, 2023Updated 2 years ago
- Background daemon which archives a list of URLs to the Internet Archive, archive.is, and other services☆62Sep 12, 2023Updated 2 years ago
- Analyse (group)chat messages. Currently supports: Facebook Messenger. Planned: Signal, Discord, WhatsApp☆42Jul 18, 2022Updated 3 years ago
- GraphicalDebugging extension for Visual Studio Code☆47Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Rule-based Kurdish Transliterator☆10May 3, 2024Updated last year
- Import data from Apple's Screen Time on macOS and iOS to ActivityWatch☆55Oct 26, 2025Updated 5 months ago
- MLX implementation of GCN, with benchmark on MPS, CUDA and CPU (M1 Pro, M2 Ultra, M3 Max).☆25Dec 16, 2023Updated 2 years ago
- A DLL load tracing tool for CPython☆64Oct 23, 2024Updated last year
- Official Implementation of "The Graph Database Interface: Scaling Online Transactional and Analytical Graph Workloads to Hundreds of Thou…☆14Jul 2, 2025Updated 8 months ago
- Auditing agents for fine-tuning safety☆20Oct 21, 2025Updated 5 months ago
- Blindspots in LLMs I've noticed while AI coding. Sonnet family emphasis.☆13Mar 20, 2025Updated last year