mithril-security / tokenizers-wasmLinks
wasm bindings for huggingface tokenizers library
☆34Updated 3 years ago
Alternatives and similar repositories for tokenizers-wasm
Users that are interested in tokenizers-wasm are comparing it to the libraries listed below
Sorting:
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Updated 2 years ago
- ☆39Updated 3 years ago
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…☆39Updated 2 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆58Updated last year
- ☆140Updated last year
- Python bindings for ggml☆146Updated last year
- A complete(grpc service and lib) Rust inference with multilingual embedding support. This version leverages the power of Rust for both GR…☆39Updated last year
- High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…☆217Updated 2 months ago
- Web browser version of StarCoder.cpp☆45Updated 2 years ago
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆76Updated 2 years ago
- Run ONNX and TensorFlow inference in the browser.☆75Updated 2 years ago
- ☆157Updated 2 years ago
- Simple high-throughput inference library☆150Updated 6 months ago
- Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tables☆21Updated 6 months ago
- ☆135Updated last year
- ReLM is a Regular Expression engine for Language Models☆107Updated 2 years ago
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆65Updated 2 years ago
- Vector Database with support for late interaction and token level embeddings.☆54Updated 5 months ago
- implement llava using candle☆15Updated last year
- Like picoGPT but for BERT.☆51Updated 2 years ago
- Locality Sensitive Hashing☆76Updated 2 years ago
- Experiments on speculative sampling with Llama models☆127Updated 2 years ago
- experiments with inference on llama☆103Updated last year
- This project aims to make RWKV Accessible to everyone using a Hugging Face like interface, while keeping it close to the R and D RWKV bra…☆65Updated 2 years ago
- SGLang is fast serving framework for large language models and vision language models.☆30Updated 2 weeks ago
- Using Large Language Models for Repo-wide Type Prediction☆112Updated 2 years ago
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆106Updated 2 years ago
- Advanced Ultra-Low Bitrate Compression Techniques for the LLaMA Family of LLMs☆110Updated last year
- ☆198Updated last year
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆47Updated last year