mithril-security / tokenizers-wasm
wasm bindings for huggingface tokenizers library
☆35Updated 2 years ago
Alternatives and similar repositories for tokenizers-wasm:
Users that are interested in tokenizers-wasm are comparing it to the libraries listed below
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- ☆37Updated 2 years ago
- ANE accelerated embedding models!☆17Updated 3 months ago
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆54Updated last year
- ☆25Updated 3 months ago
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆75Updated last year
- A high-performance constrained decoding engine based on context free grammar in Rust☆47Updated 2 months ago
- A library for squeakily cleaning and filtering language datasets.☆46Updated last year
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆31Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- ☆153Updated 2 years ago
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆63Updated last year
- Modular Rust transformer/LLM library using Candle☆36Updated 10 months ago
- Latent Large Language Models☆17Updated 6 months ago
- This project aims to make RWKV Accessible to everyone using a Hugging Face like interface, while keeping it close to the R and D RWKV bra…☆64Updated last year
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆124Updated 2 months ago
- Vector Database with support for late interaction and token level embeddings.☆53Updated 5 months ago
- experiments with inference on llama☆104Updated 9 months ago
- webassembly binding for Hora Approximate Nearest Neighbor Search Library☆55Updated 3 years ago
- ☆26Updated 2 years ago
- Make triton easier☆47Updated 9 months ago
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBread☆18Updated 11 months ago
- Structured inference with Llama 2 in your browser☆52Updated 4 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆24Updated last year
- ☆32Updated last year
- **ARCHIVED** Filesystem interface to 🤗 Hub☆58Updated last year