mithril-security / tokenizers-wasmLinks
wasm bindings for huggingface tokenizers library
☆34Updated 2 years ago
Alternatives and similar repositories for tokenizers-wasm
Users that are interested in tokenizers-wasm are comparing it to the libraries listed below
Sorting:
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- ☆39Updated 2 years ago
- ☆49Updated 5 years ago
- utilities for loading and running text embeddings with onnx☆44Updated 10 months ago
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…☆26Updated 3 months ago
- Because it's there.☆16Updated 9 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆55Updated last year
- High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…☆177Updated this week
- A library for squeakily cleaning and filtering language datasets.☆47Updated last year
- ANE accelerated embedding models!☆18Updated 6 months ago
- ☆26Updated 6 months ago
- Sentence Embedding as a Service☆15Updated last year
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated 2 years ago
- Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tables☆20Updated last month
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆102Updated 2 years ago
- Modular Rust transformer/LLM library using Candle☆36Updated last year
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆38Updated last year
- [WIP] Transformer to embed Danbooru labelsets☆13Updated last year
- Web browser version of StarCoder.cpp☆45Updated last year
- ☆35Updated 2 years ago
- A high-performance constrained decoding engine based on context free grammar in Rust☆53Updated last month
- Parallel wasm Barnes-Hut t-SNE implementation written in Rust.☆21Updated last year
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆64Updated last year
- ☆11Updated 4 months ago
- ☆26Updated 2 years ago
- Simple high-throughput inference library☆119Updated last month
- Rust crate for some audio utilities☆24Updated 3 months ago
- Training code for Sparse Autoencoders on Embedding models☆38Updated 3 months ago
- Like picoGPT but for BERT.☆50Updated 2 years ago
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆45Updated last year