Fast tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and WordPiece tokenization in JavaScript, Python and Rust.
☆49Apr 26, 2026Updated last week
Alternatives and similar repositories for kitoken
Users that are interested in kitoken are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PHP low-level client for Vespa. https://vespa.ai/☆17Jan 22, 2026Updated 3 months ago
- Trainable embedding transformation for confidence estimation, feature extraction, explainability and conversion from dense to sparse.☆28Jun 9, 2025Updated 10 months ago
- A high-performance constrained decoding engine based on context free grammar in Rust☆58May 22, 2025Updated 11 months ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- Resa: Transparent Reasoning Models via SAEs☆48Sep 23, 2025Updated 7 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- gRPC server for hnswlib☆16Mar 6, 2023Updated 3 years ago
- Normalize text string☆12Nov 6, 2018Updated 7 years ago
- zero-vocab or low-vocab embeddings☆18Jul 17, 2022Updated 3 years ago
- A password validation and generation tool kit☆13Jan 7, 2023Updated 3 years ago
- Typescript declarations for the current live World of Warcraft Classic LUA API☆10May 7, 2023Updated 2 years ago
- ☆25May 14, 2025Updated 11 months ago
- Universal Utility Toolkit☆20Oct 12, 2024Updated last year
- A simple, "Ollama-like" tool for managing and running GGUF language models from your terminal.☆23Jan 2, 2026Updated 4 months ago
- 🧬🔍 Vecgo is a pure Go, embeddable, hybrid vector database designed for high-performance production workloads. It combines commit-orient…☆15Jan 19, 2026Updated 3 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Private self-improvement coaching with open-source LLMs☆17Mar 7, 2024Updated 2 years ago
- CMS 230 - Computer Organization and Architecture☆10Sep 6, 2024Updated last year
- Model implementation for the contextual embeddings project☆47Jun 2, 2025Updated 11 months ago
- Fluent dreaming for language models☆13Jul 22, 2024Updated last year
- Website for TREC RAG☆14Apr 24, 2026Updated last week
- A Python utility for indexing file lines. Best demo honourable mention at ECIR 2024.☆23Nov 9, 2025Updated 5 months ago
- Unofficial Claude Desktop for Linux☆23Mar 27, 2026Updated last month
- Website for Applied-LLMs work☆29Jan 13, 2026Updated 3 months ago
- An R package to convert SingeCellExperiment and Seurat objects into anndata as comprehensively as possible.☆11Apr 23, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Guichan is a C++ GUI library designed for games.☆14Oct 22, 2025Updated 6 months ago
- trying to reproduce suno v3☆34Jan 29, 2025Updated last year
- A safer, drop-in replacement for Go's syscall/js JavaScript package.☆19Mar 5, 2023Updated 3 years ago
- Certificate transparency SCT verification library in rust☆55Mar 31, 2026Updated last month
- Authentication Callout Library☆22Apr 20, 2026Updated 2 weeks ago
- Tulp is a command-line tool that can help you create and process piped content using the power of ChatGPT directly from the terminal.☆20Jan 20, 2026Updated 3 months ago
- Handling of multiple types of media documents for Django☆28Nov 9, 2015Updated 10 years ago
- Almost-Pure Rust TTS Engine for my Rustnation talk☆51Mar 10, 2026Updated last month
- Simple but reliable memory allocator for embedded Rust and #![no_std]☆14Sep 19, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Web client for Vespa.ai☆57Jul 2, 2025Updated 10 months ago
- llm_utils: Basic LLM tools, best practices, and minimal abstraction.☆48Feb 18, 2025Updated last year
- Conditional Random Fields implemented as Lasagne layer☆10Jul 22, 2016Updated 9 years ago
- A Python package for PME (Public Market Equivalent) calculation☆13Jan 16, 2026Updated 3 months ago
- Narwhal is a keyword and KEY NARRATIVE manager that creates language-aware classes. Because Narhwal does not use NLP it avoids complexity…☆12Oct 16, 2018Updated 7 years ago
- A Rust hash table using 8-way hopscotch hashing with constant-time worst-case lookups, and SIMD acceleration☆23Oct 4, 2025Updated 7 months ago
- This is a generic Python client for BioThings APIs☆20Mar 31, 2026Updated last month