serega / gaoya
Locality Sensitive Hashing
☆73Updated last year
Alternatives and similar repositories for gaoya:
Users that are interested in gaoya are comparing it to the libraries listed below
- Neural syntax annotator, supporting sequence labeling, lemmatization, and dependency parsing.☆76Updated last year
- High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…☆83Updated last month
- Inference engine for GLiNER models, in Rust☆57Updated last month
- Modular Rust transformer/LLM library using Candle☆36Updated last year
- A high-performance constrained decoding engine based on context free grammar in Rust☆51Updated 4 months ago
- Library for fast text representation and classification.☆28Updated last year
- A Demo server serving Bert through ONNX with GPU written in Rust with <3☆40Updated 3 years ago
- Rust binding to crfsuite☆25Updated 3 years ago
- Efficient BM25 with DuckDB 🦆☆48Updated 4 months ago
- ☆29Updated 5 months ago
- Rust wrapper for Microsoft's ONNX Runtime with CUDA support (version 1.7)☆24Updated 2 years ago
- NLP with Rust for Python 🦀🐍☆62Updated 11 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆101Updated last year
- ☆58Updated 2 years ago
- Rust port of sentence-transformers (https://github.com/UKPLab/sentence-transformers)☆114Updated 7 months ago
- Locality Sensitive Hashing in Rust with Python bindings☆115Updated last year
- Pure Rust port of CRFsuite: a fast implementation of Conditional Random Fields (CRFs)☆29Updated last week
- Rust port of https://github.com/UKPLab/sentence-transformers☆28Updated 5 years ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆131Updated 4 months ago
- A small rust-based data loader☆24Updated 4 months ago
- fastText Rust binding☆59Updated last year
- ☆129Updated last year
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…☆22Updated last month
- GGML implementation of BERT model with Python bindings and quantization.☆56Updated last year
- The pipeline for the OSCAR corpus☆168Updated last year
- 8-bit floating point types for Rust☆47Updated last month
- Tree-based indexes for neural-search☆31Updated last year
- 🐍 Python bidding for the Hora Approximate Nearest Neighbor Search Algorithm library☆72Updated 3 years ago
- ☆69Updated 4 months ago
- Succeeded by syntaxdot-transformers: https://github.com/tensordot/syntaxdot/tree/main/syntaxdot-transformers☆19Updated 4 years ago