softwaredoug / np-sims
numpy ufuncs for vector similarity
☆14Updated last year
Alternatives and similar repositories for np-sims:
Users that are interested in np-sims are comparing it to the libraries listed below
- Neural Solr = Solr 9 + Mighty Inference + Node☆16Updated 2 years ago
- Highly concurrent and fast content processing for Mighty Inference Server☆10Updated 2 years ago
- Cortex-compatible model server for Python and TensorFlow☆17Updated 2 years ago
- Sentence Embedding as a Service☆15Updated last year
- A fork of llama3.c used to do some R&D on inferencing☆19Updated 2 months ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- Efficient BM25 with DuckDB 🦆☆39Updated 2 months ago
- Graph Engine for Exploration and Search☆40Updated last year
- Documentation effort for the BookCorpus dataset☆33Updated 3 years ago
- Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Sear…☆85Updated 3 years ago
- Testing various image matching algorithms' performance on the Pinecone vector DB☆43Updated last year
- Locality Sensitive Hashing☆71Updated last year
- ☆63Updated 2 months ago
- Rust bindings for CTranslate2☆14Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆54Updated last year
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization, with PyTorch/CUDA☆35Updated last year
- A CLI tool for managing OpenAI batch processing jobs with ease.☆33Updated 6 months ago
- Detecting gibberish as a type of sentiment analysis with GPT2☆23Updated 4 years ago
- Creating Generative AI Apps which work☆16Updated 7 months ago
- This is the repo for the container that holds the models for the text2vec-transformers module☆49Updated last month
- Embedding models from Jina AI☆58Updated last year
- utilities for loading and running text embeddings with onnx☆44Updated 6 months ago
- Vector search in Lucene based search attempting to use just the existing Lucene data structures (experimental)☆43Updated 5 years ago
- Efficiently computing & storing token n-grams from large corpora☆18Updated 4 months ago
- Vector functions and indexing for SQLite☆11Updated last year
- Search for similar short strings☆52Updated 4 years ago
- Vespa application making an index of the CORD-19 dataset.☆39Updated last month
- A production-ready, scalable Indexer for the Jina neural search framework, based on HNSW and PSQL☆29Updated 2 years ago