javirandor / anthropic-tokenizerView external linksLinks
Approximation of the Claude 3 tokenizer by inspecting generation stream
☆151Jul 22, 2024Updated last year
Alternatives and similar repositories for anthropic-tokenizer
Users that are interested in anthropic-tokenizer are comparing it to the libraries listed below
Sorting:
- Using modal.com to process FineWeb-edu data☆20Apr 5, 2025Updated 10 months ago
- Coord: A Unified Interface for All Models☆18Feb 2, 2026Updated 2 weeks ago
- Repository for go shared libraries (for now).☆11Dec 1, 2025Updated 2 months ago
- utilities for loading and running text embeddings with onnx☆45Aug 16, 2025Updated 5 months ago
- 1-Click is all you need.☆63Apr 29, 2024Updated last year
- ☆26Feb 11, 2025Updated last year
- 비즈엠 개발 서버에서 전화번호 인증을 쉽게 할 수 있는 웹사이트입니다.☆10Feb 27, 2023Updated 2 years ago
- Make running benchmark simple yet maintainable, again. Now only supports Korean-based cross-encoder.☆29Dec 2, 2025Updated 2 months ago
- [Suspended] Modern, customizable AI character frontend for enthusiasts (inspired by SillyTavern)☆10Nov 8, 2024Updated last year
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings☆13May 22, 2025Updated 8 months ago
- ☆12Apr 17, 2024Updated last year
- 자체 구축한 한국어 평가 데이터셋을 이용한 한국어 모델 평가☆31May 31, 2024Updated last year
- Adapter / facade for language models (OpenAI, Anthropic, Cohere, local transformers, etc)☆20Sep 21, 2023Updated 2 years ago
- ☆17Jun 23, 2025Updated 7 months ago
- Training code for Sparse Autoencoders on Embedding models☆39Feb 27, 2025Updated 11 months ago
- assign color hues to a collection of text fragments based on embeddings☆20Jun 15, 2024Updated last year
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆18Jul 24, 2025Updated 6 months ago
- Auto-generate programs in C derived languages for multiple platforms☆18Aug 4, 2025Updated 6 months ago
- MEXMA: Token-level objectives improve sentence representations☆42Jan 6, 2025Updated last year
- VSCode extension for working with Architecture As A Code in the C4 model. Includes syntax highlighting, diagram preview, and tools for wo…☆31Feb 6, 2026Updated last week
- A single static file as vector database, using the cloud-native flatgeobuf file format and http range requests☆17Oct 28, 2025Updated 3 months ago
- ⚡Library for easy interaction with RabbitMQ 🐰☆19Oct 13, 2023Updated 2 years ago
- Code for the paper "Fishing for Magikarp"☆180May 15, 2025Updated 9 months ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Jun 3, 2024Updated last year
- some common Huggingface transformers in maximal update parametrization (µP)☆87Mar 14, 2022Updated 3 years ago
- FormFill is a CLI tool that uses LLMs to automatically fill out PDF forms.☆29Nov 22, 2024Updated last year
- A pytest plugin for running and analyzing LLM evaluation tests.☆153Feb 5, 2025Updated last year
- Gugugo: 한국어 오픈소스 번역 모델 프로젝트☆85Apr 7, 2024Updated last year
- Efficient vector database for hundred millions of embeddings.☆211May 17, 2024Updated last year
- My NER Experiments with ModernBERT and Ettin☆26Jul 17, 2025Updated 6 months ago
- Gradient Boosting Models on Real-Time Sensor Data for AI-Enhanced Vehicle Predictive Maintenance. By using a web-based interface to forec…☆19Nov 17, 2024Updated last year
- 💬 Minimalistic repository to reproduce and serve CAMEL models.☆25Jun 26, 2023Updated 2 years ago
- The "GPT-API-Accelerate" project provides a set of Python classes for accelerating the process of generating responses to prompts using t…☆23Oct 12, 2024Updated last year
- Korean Sentence Embedding Model Performance Benchmark for RAG☆50Jan 27, 2025Updated last year
- Bash functions-as-a-service☆31Dec 20, 2024Updated last year
- Code for SaGe subword tokenizer (EACL 2023)☆27Nov 30, 2024Updated last year
- High-performance vector search engine with no loss of accuracy through GPU and dynamic placement☆31Jul 12, 2025Updated 7 months ago
- Rust-based Lua runtime for cryptocurrency trading☆22Jun 14, 2025Updated 8 months ago
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrieval☆60Jun 20, 2024Updated last year