softwaredoug / np-sims
numpy ufuncs for vector similarity
☆14Updated last year
Alternatives and similar repositories for np-sims
Users that are interested in np-sims are comparing it to the libraries listed below
Sorting:
- Neural Solr = Solr 9 + Mighty Inference + Node☆17Updated 2 years ago
- Highly concurrent and fast content processing for Mighty Inference Server☆10Updated 2 years ago
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…☆24Updated last month
- Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Sear…☆85Updated 4 years ago
- Documentation effort for the BookCorpus dataset☆34Updated 3 years ago
- ☆16Updated 4 years ago
- Graph Engine for Exploration and Search☆40Updated last year
- A fork of llama3.c used to do some R&D on inferencing☆21Updated 4 months ago
- Lossless normalization of uppercase characters☆11Updated last year
- HSNW module for Redis☆57Updated 4 years ago
- Hybrid Search (BM25 & Vector) with SQLite☆15Updated 9 months ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- Cortex-compatible model server for Python and TensorFlow☆17Updated 2 years ago
- Maintain a FAISS index for specified Datasette tables☆36Updated 11 months ago
- Sentence Embedding as a Service☆15Updated last year
- Vector functions and indexing for SQLite☆11Updated 2 years ago
- GGML implementation of BERT model with Python bindings and quantization.☆56Updated last year
- Efficient BM25 with DuckDB 🦆☆48Updated 4 months ago
- History of Open-Source IR Systems☆11Updated 3 months ago
- Application configuration and scripts for search on https://docs.vespa.ai/☆12Updated 2 weeks ago
- ☆31Updated 2 years ago
- Distributed Approximate Nearest Neighbors Database https://anndb.com☆36Updated 4 years ago
- A CLI tool for managing OpenAI batch processing jobs with ease.☆35Updated 2 weeks ago
- utilities for loading and running text embeddings with onnx☆44Updated 9 months ago
- NetworkX-like Python experience for Postgres, SQLite, MongoDB, and Neo4J☆23Updated 2 months ago
- A python library to generate highly realistic typos (fuzz-testing)☆11Updated 2 months ago
- Interactive Model Iteration with Weak Supervision and Pre-Trained Embeddings☆77Updated 2 years ago
- Implementation Saved Searches a la ElasticSearch Percolator☆12Updated 2 years ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization, with PyTorch/CUDA☆36Updated last year
- Rust bindings for CTranslate2☆14Updated last year