mithril-security / tokenizers-wasmLinks
wasm bindings for huggingface tokenizers library
☆34Updated 3 years ago
Alternatives and similar repositories for tokenizers-wasm
Users that are interested in tokenizers-wasm are comparing it to the libraries listed below
Sorting:
- ☆39Updated 3 years ago
- Simple high-throughput inference library☆155Updated 8 months ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Updated 2 years ago
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…☆42Updated 4 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆58Updated last year
- A high-performance constrained decoding engine based on context free grammar in Rust☆58Updated 8 months ago
- Modular Rust transformer/LLM library using Candle☆38Updated last year
- Python bindings for ggml☆147Updated last year
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆76Updated 2 years ago
- Web browser version of StarCoder.cpp☆46Updated 2 years ago
- A complete(grpc service and lib) Rust inference with multilingual embedding support. This version leverages the power of Rust for both GR…☆39Updated last year
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆66Updated 2 years ago
- Advanced Ultra-Low Bitrate Compression Techniques for the LLaMA Family of LLMs☆110Updated 2 years ago
- Vector Database with support for late interaction and token level embeddings.☆54Updated 7 months ago
- ☆157Updated 2 years ago
- High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…☆230Updated last month
- ☆49Updated 5 years ago
- ReLM is a Regular Expression engine for Language Models☆107Updated 2 years ago
- GGUF parser in Python☆28Updated last year
- ☆135Updated last year
- Inference of Mamba and Mamba2 models in pure C☆196Updated 2 weeks ago
- Make triton easier☆50Updated last year
- Your one stop CLI for ONNX model analysis.☆47Updated 3 years ago
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆40Updated 2 years ago
- Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tables☆21Updated 8 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated 2 years ago
- ☆140Updated last year
- This project aims to make RWKV Accessible to everyone using a Hugging Face like interface, while keeping it close to the R and D RWKV bra…☆65Updated 2 years ago
- Run ONNX and TensorFlow inference in the browser.☆75Updated 3 years ago
- tinygrad port of the RWKV large language model.☆45Updated 11 months ago