ScJa / document-search-engine
A really fast document ranking engine using BM25 and TF-IDF. Based on Python using NLP packages NLTK and spacY.
☆15Updated 6 years ago
Alternatives and similar repositories for document-search-engine:
Users that are interested in document-search-engine are comparing it to the libraries listed below
- Document Search Engine project with TF-IDF abd Google universal sentence encoder model☆53Updated last year
- Document Search Engine Tool☆72Updated 2 years ago
- Semantic Segmentation of Legal texts that labels sentences with one of 7 rhetorical roles.☆68Updated 7 months ago
- ☆64Updated last year
- A simple Flask & React app to demonstrate how to generate text with OpenAI's GPT-2☆52Updated 2 years ago
- This repository contains code and data download instructions for the workshop paper "Improving Hierarchical Product Classification using …☆17Updated 3 years ago
- Use Kore.ai's Knowledge Graph Generator to automatically extract terms from FAQs, define the hierarchy between these terms, and also asso…☆15Updated last year
- This repository provides various Python methods for finding and aggregating synonyms for an individual word or a list of words.☆33Updated last year
- An app that extracts your twitter threads into a downloadable CSV file.☆10Updated last year
- Extracting narrative timelines (i.e. order and timing of events) from text☆20Updated 5 years ago
- A simple library for training named entity recognition model from partially annotated data☆22Updated last year
- semantically distinct key phrase extraction using hilbert hashes.☆48Updated 2 years ago
- XAI based human-in-the-loop framework for automatic rule-learning.☆47Updated 6 months ago
- ☆69Updated 3 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 2 years ago
- ☆28Updated 4 years ago
- doccano auto labeling pipeline helps doccano to annotate a document automatically.☆40Updated last year
- Sample datasets of over 400 Instagram coding influencers☆11Updated last month
- 📄Neural Sentential Paraphrase Generation to Augment Chatbot Training Dataset☆21Updated 2 years ago
- Neural Elastic Inference and Search☆19Updated 5 years ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆46Updated 5 months ago
- Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Sear…☆85Updated 3 years ago
- An open-source NLP library: fast text cleaning and preprocessing☆23Updated 3 years ago
- sequence tagging with spaCy and crfsuite☆18Updated last year
- simple rule based named entity recognition☆43Updated 2 years ago
- Generate True or False questions from any content with OpenAI GPT2 text generation, Sentence-BERT semantic search and Berkley constituenc…☆33Updated 4 years ago
- Building a bot to handle general tasks for insurance.☆23Updated last year
- This is the frontend layer of SearchX. SearchX is a scalable collaborative search system being developed by Lambda Lab of TU Delft.☆14Updated last year
- 🚀GUI for training spaCy models☆54Updated 3 years ago
- Graph databases, Knowledge Graphs, SPARQ☆76Updated 3 years ago