softwaredoug / np-simsLinks
numpy ufuncs for vector similarity
☆14Updated 2 years ago
Alternatives and similar repositories for np-sims
Users that are interested in np-sims are comparing it to the libraries listed below
Sorting:
- hnsw implemented by python☆21Updated 6 years ago
- The pipeline for the OSCAR corpus☆176Updated 3 months ago
- Full text search that feels like a numpy array☆301Updated last week
- Performance evaluation of nearest neighbor search using Vespa, Elasticsearch and Open Distro for Elasticsearch K-NN☆117Updated 4 years ago
- Library for fast text representation and classification.☆31Updated 2 years ago
- Documentation effort for the BookCorpus dataset☆34Updated 4 years ago
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆81Updated 7 years ago
- hnsqlite integrates hnswlib and sqlite for simple text embedding search☆160Updated 2 years ago
- A toolkit for CDX indices such as Common Crawl and the Internet Archive's Wayback Machine☆198Updated 2 weeks ago
- Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Sear…☆86Updated 4 years ago
- Neural Search☆334Updated last year
- Keyvi - the key value index. It is an in-memory FST-based data structure highly optimized for size and lookup performance.☆257Updated last week
- Java port of SymSpell: 1 million times faster through Symmetric Delete spelling correction algorithm☆67Updated 7 months ago
- Search with BERT vectors in Solr, Elasticsearch, OpenSearch and GSI APU☆166Updated last year
- This is the repo for the container that holds the models for the text2vec-transformers module☆60Updated 3 months ago
- Fast Text Classification with Compressors dictionary☆150Updated 2 years ago
- A robust web archive analytics toolkit☆129Updated 3 months ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 4 years ago
- Extracts plain text, language identification and more metadata from WARC records☆23Updated 4 months ago
- SLING - A natural language frame semantics parser☆174Updated last week
- Grammar Induction using a Template Tree Approach☆47Updated 9 months ago
- xfspell — the Transformer Spell Checker☆189Updated 5 years ago
- Vespa application making an index of the CORD-19 dataset.☆40Updated 7 months ago
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine☆245Updated 2 years ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Updated 2 years ago
- A word2vec negative sampling implementation with correct CBOW update.☆261Updated 4 years ago
- Completion After Prompt Probability. Make your LLM make a choice☆82Updated last year
- A library for building and serving multi-node distributed faiss indices.☆276Updated 2 years ago
- ⚡ A fast embedded library for approximate nearest neighbor search☆236Updated 2 years ago
- SimString☆113Updated 4 years ago