FurkanToprak / OkapiBM25
Well-tested implementation of the OkapiBM25 algorithm. Install the npm package. Now at 21K downloads!
☆18Updated 5 months ago
Alternatives and similar repositories for OkapiBM25:
Users that are interested in OkapiBM25 are comparing it to the libraries listed below
- Multilingual tokenizer that automatically tags each token with its type☆61Updated 2 years ago
- Fast Full Text Search based on BM25☆60Updated 2 years ago
- Tunable full text search engine in JavaScript that: (1) works natively on web apps like Express.js; (2) easy to customize (via BM25) to s…☆33Updated 6 years ago
- ☆70Updated 2 years ago
- In browser active learning and guided search☆17Updated last year
- Distance/Similarity functions for Bag of Words, Strings, Vectors and more.☆23Updated last year
- Traptor -- A distributed Twitter feed☆26Updated 2 years ago
- Dead simple cron service for making HTTP calls on a regular schedule.☆14Updated 4 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆56Updated last year
- Fuzzy Categorical Distances☆14Updated 4 years ago
- Nodejs binding for fasttext representation and classification.☆43Updated last year
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Updated 3 years ago
- A raspberry pi 64bit image with spacy and neuralcoref pre-installed☆21Updated 5 years ago
- An index data structure for approximate string search.☆23Updated 5 years ago
- A highly configurable and dynamic rules engine based on JSON Schema☆49Updated last year
- email dataset for email signature parsing☆55Updated 8 years ago
- A demo that shows how to build a semantic search experience with Typesense's vector search feature and Instantsearch.js☆26Updated last year
- Example how to pre-process news articles with textbox and index on Elastic Search☆13Updated 7 years ago
- Linking Entities in CommonCrawl Dataset onto Wikipedia Concepts☆59Updated 12 years ago
- A tool for visualizing trees, tailored specifically to the analysis of parse trees.☆81Updated 4 years ago
- A lightweight JavaScript client library for the Wikimedia Pageviews API for Wikipedia and various of its sister projects for Node.js and …☆27Updated 4 years ago
- Multi-Langauge Identification☆29Updated 7 months ago
- ☆9Updated 4 years ago
- Accurate and fast sentiment scoring of phrases with #hashtags, emoticons :) & emojis 🎉☆62Updated 2 years ago
- Python binding for gumbo-parser using Cython☆14Updated 8 years ago
- A React component to make correcting automated transcriptions of audio and video easier and faster. Using the SlateJs editor.☆80Updated 2 years ago
- A example survey app built with Lowdefy.☆17Updated 2 years ago
- Using embeddings compressed by Product Quantization, in Javascript☆31Updated last year
- Scripts and microservice to feed an ElasticSearch with Wikidata and Inventaire entities, and keep those up-to-date☆41Updated 4 years ago
- spaCy pipeline component for adding text readability meta data to Doc objects.☆56Updated 5 years ago