Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization, with PyTorch/CUDA
☆42Feb 27, 2024Updated 2 years ago
Alternatives and similar repositories for minbpe-pytorch
Users that are interested in minbpe-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Project exploring 3D volumetric rendering of NEXRAD radar data.☆12Oct 23, 2023Updated 2 years ago
- ✒️ A gallery of experiments with Scalable Vector Graphics (SVG) and interactive visualizations.☆13Jan 6, 2023Updated 3 years ago
- Hypercorn is an ASGI and WSGI Server based on Hyper libraries and inspired by Gunicorn.☆15Jan 12, 2026Updated 3 months ago
- The AI that helps you achieve your goals☆11Feb 4, 2024Updated 2 years ago
- Accompanying codebase for neuroscope.io, a website for displaying max activating dataset examples for language model neurons☆13Feb 13, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Customizable charts made with TikZ and LaTeX3☆14Feb 11, 2023Updated 3 years ago
- An analog touch screen joystick that pretends to be a bevy gamepad☆13Jul 13, 2024Updated last year
- A small game demonstrating a grid distortion effect☆15Oct 5, 2021Updated 4 years ago
- see github.com/understanding-search/maze-transformer☆10Dec 8, 2023Updated 2 years ago
- Bindings to Nvidia Labs's ꟻLIP image comparison and error visualization library☆22Apr 13, 2026Updated last week
- Transcribe with ease :D☆16Jun 21, 2023Updated 2 years ago
- Export userdata from your reddit accounts. Submissions, comments, saved, upvoted contents are supported.☆23Oct 31, 2024Updated last year
- GPT4 Tokenizer Visualizer☆23May 21, 2023Updated 2 years ago
- Implementation of a multi-agent system for the modeling of carpooling in a city with one-way streets. Used Python and the Mesa package fo…☆14Jan 19, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆28Jul 29, 2025Updated 8 months ago
- ☆22Sep 9, 2021Updated 4 years ago
- Rapid prototyping GUI, and visual printf-style debugging for computer vision development.☆25Apr 18, 2022Updated 4 years ago
- Tools to analyse and experiment with ActivityWatch data☆35Jun 25, 2024Updated last year
- IonSolver is a magnetohydrodynamic simulation software featuring an extended Lattice Boltzmann method and GPU acceleration☆22Nov 10, 2025Updated 5 months ago
- ChatGPT-History-Downloader is a Chrome/Edge extension to help download chat history with OpenAI ChatGPT☆29Aug 23, 2025Updated 7 months ago
- The Parrot stable and deterministic multi-threading system.☆25Nov 9, 2013Updated 12 years ago
- Visual Transformer Mechanistic Analysis Tool☆36Jun 3, 2023Updated 2 years ago
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆40Aug 2, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Micro-framework for publishing linked data☆11Aug 1, 2017Updated 8 years ago
- ☆35Sep 13, 2023Updated 2 years ago
- ☆33Jan 13, 2022Updated 4 years ago
- Demo of fine-tuning QA models for answering FAQ of cloud providers documentation☆11Mar 7, 2023Updated 3 years ago
- Export NetworkX graphs to TikZ directly☆31May 7, 2025Updated 11 months ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆77Apr 14, 2026Updated last week
- Code and experiments for the COLING2020 paper "Conception: Multilingually-Enhanced, Human-Readable Concept Vector Representations".☆11Dec 9, 2020Updated 5 years ago
- BabelNet (and WordNet) sense embedding trained with Word2Vec and FastText☆10Sep 3, 2019Updated 6 years ago
- Python library for analyzing, exploring, and visualizing epitrochoids and hypotrochoids in just a few lines of code☆31Jul 7, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Analyse (group)chat messages. Currently supports: Facebook Messenger. Planned: Signal, Discord, WhatsApp☆41Jul 18, 2022Updated 3 years ago
- GraphicalDebugging extension for Visual Studio Code☆47Mar 24, 2026Updated 3 weeks ago
- ☆13Oct 5, 2020Updated 5 years ago
- 3d Cellular Automata using WGPU in Rust (for the web and using compute shaders)☆37Mar 4, 2025Updated last year
- Topological Neuron Synthesis☆42Feb 26, 2025Updated last year
- Artifact of paper "Exploiting Recent SIMD Architectural Advances for Irregular Applications"☆11Jun 23, 2016Updated 9 years ago
- Python implementation of the random-walk inductive classification algorithm Modified Adsorption from P. Talukdar☆15Jul 30, 2014Updated 11 years ago