Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization. Pure JavaScript port of karpathy/minbpe
☆18Feb 19, 2024Updated 2 years ago
Alternatives and similar repositories for minbpe
Users that are interested in minbpe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Split large texts into chunks with a maximum number of tokens. Split by fixed size or by sentence.☆34Mar 1, 2024Updated 2 years ago
- A set of UI components to help you integrate Orama on your website or app.☆17Jun 19, 2025Updated 9 months ago
- Minimalist (yet helpful) monorepo manager for Deno☆18Dec 13, 2023Updated 2 years ago
- ☆17Jan 4, 2023Updated 3 years ago
- JCS (JSON Canonicalization Scheme), JSON digests, and JSON Merkle hashes☆16Mar 10, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 🚒✨ Rescue: better errors through types (a more type directed MonadThrow/MonadCatch)☆20Feb 1, 2022Updated 4 years ago
- A set of macros and functions to make defining a C module easier☆11Sep 9, 2019Updated 6 years ago
- A simple, Apollo-based, GraphQL driver to be used with Cycle's most-run☆13Aug 7, 2019Updated 6 years ago
- An application to create nodejs.org distribution index files: index.json and index.tab☆22Mar 20, 2026Updated 3 weeks ago
- A collection of additional language phonology settings for use with VulgarLang.☆13Aug 19, 2022Updated 3 years ago
- A Modeling Notation ꕤ☆15Mar 3, 2026Updated last month
- Non-allocating command line flag parser☆17Mar 26, 2026Updated 2 weeks ago
- unb internet of things home page☆11Oct 16, 2025Updated 5 months ago
- google summer of code repository☆19Feb 14, 2026Updated 2 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- NodeSecure Governance (Code of conduct & Contribution guidelines)☆16Apr 5, 2026Updated last week
- ☆11Jul 25, 2016Updated 9 years ago
- List key diff algorithm.☆15Nov 10, 2015Updated 10 years ago
- ☆17Feb 27, 2025Updated last year
- ☆11Jul 4, 2018Updated 7 years ago
- A website that lets you know where to watch a movie built on Next.js and Meilisearch, deployed on Vercel with the Meilisearch + Vercel in…☆12Aug 10, 2023Updated 2 years ago
- Scroll to the current anchor in the url if possible☆11Jun 18, 2017Updated 8 years ago
- A small utility for creating warnings and emitting them☆38Apr 8, 2026Updated last week
- 📤 Magically generate `fetch` types from OpenAPI schemas for zero-cost browser-native api clients☆21Apr 17, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Parse Dat protocol SLEEP files.☆13Apr 28, 2021Updated 4 years ago
- A remark plugin to add metadata about headings in the file☆17Oct 14, 2022Updated 3 years ago
- A suffix trie written in JavaScript☆10Jul 26, 2021Updated 4 years ago
- Generate typescript model definitions with just the JSON schema (including reference resolution).☆14Feb 1, 2021Updated 5 years ago
- 📖 CLI for dynamic runbooks: a structured and auditable approach to creating and executing operational procedures, bridging the gap betwe…☆19Mar 7, 2026Updated last month
- ☆43Mar 6, 2023Updated 3 years ago
- an animated Mayan Calendar☆12Jan 12, 2014Updated 12 years ago
- LinguaLibre – Massive Open Audio Recording system☆13Jan 19, 2021Updated 5 years ago
- Implementation of "practical type inference for arbitrary-rank types" in Javascript☆12Mar 27, 2019Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- (Alternative) Visualizer for XState☆12Mar 2, 2023Updated 3 years ago
- AWS CDK constructs for AWS ControlTower☆19Updated this week
- Node 18's node:test, as an npm package☆99Dec 21, 2024Updated last year
- ☆84Apr 23, 2025Updated 11 months ago
- A simple HTML avatar generator for Habbo.☆10Jun 29, 2016Updated 9 years ago
- Photos of U.S. congressional representatives and scrapers used to collect them.☆14Jan 30, 2026Updated 2 months ago
- HTML5 cache manifest generation.☆35Feb 4, 2012Updated 14 years ago