FurkanToprak / OkapiBM25
Well-tested implementation of the OkapiBM25 algorithm. Install the npm package!
☆17Updated 4 months ago
Alternatives and similar repositories for OkapiBM25:
Users that are interested in OkapiBM25 are comparing it to the libraries listed below
- Fast Full Text Search based on BM25☆60Updated 2 years ago
- Multilingual tokenizer that automatically tags each token with its type☆61Updated last year
- Distance/Similarity functions for Bag of Words, Strings, Vectors and more.☆23Updated last year
- Tunable full text search engine in JavaScript that: (1) works natively on web apps like Express.js; (2) easy to customize (via BM25) to s…☆33Updated 6 years ago
- Simple end-to-end encryption for webapps☆15Updated last week
- A demo that shows how to build a semantic search experience with Typesense's vector search feature and Instantsearch.js☆26Updated last year
- Semantic Search + Keyword Search + Hybrid Search + Filtering + Faceting on 300K HN Comments☆45Updated last month
- Fuzzy Categorical Distances☆14Updated 4 years ago
- Dead simple cron service for making HTTP calls on a regular schedule.☆14Updated 4 years ago
- Node starter kit for semantic-search. Uses Mighty Inference Server with Qdrant vector search.☆15Updated last year
- FalkorDB Python Client☆13Updated this week
- A general-purpose lightweight sandbox for safely executing user programs☆16Updated 4 years ago
- Fast Metaphone implementation☆48Updated 2 years ago
- Hubcap is an autonomous AI agent in 25 lines of code: a small Autobot that you can't trust. *This is the Python fork/port* from https://g…☆14Updated last year
- Using embeddings compressed by Product Quantization, in Javascript☆31Updated last year
- JSON parser for streaming objects live from an LLM's output☆29Updated 11 months ago
- An index data structure for approximate string search.☆23Updated 5 years ago
- A pure JS implementation of the Rapid Automated Keyword Extraction (RAKE) algorithm.☆34Updated 6 years ago
- Fast lossless JSON parse event streaming, in JavaScript.☆36Updated 4 months ago
- rerank library for easy reranking of results☆34Updated 4 months ago
- Javascript library to rapidly annotate paragraphs of text on websites with the goal of making the annotation process as fast and simple a…☆31Updated 7 years ago
- A Python canonicalizer to disambiguate and recognize known names from a poor quality data entry list.☆20Updated 8 years ago
- spaCy on the web☆45Updated last year
- Parallel wasm Barnes-Hut t-SNE implementation written in Rust.☆16Updated 7 months ago
- Simple structured data from any LLM☆20Updated this week
- ☆21Updated 3 months ago
- Datasette plugin for streaming SQLite database backups to S3, using Litestream!☆14Updated last year
- Quickly estimate the similarity between many sets☆51Updated 2 years ago
- A CLI tool for managing OpenAI batch processing jobs with ease.☆29Updated 5 months ago
- Application configuration and scripts for search on https://docs.vespa.ai/☆13Updated 2 weeks ago