ScJa / document-search-engine
A really fast document ranking engine using BM25 and TF-IDF. Based on Python using NLP packages NLTK and spacY.
☆16Updated 7 years ago
Alternatives and similar repositories for document-search-engine
Users that are interested in document-search-engine are comparing it to the libraries listed below
Sorting:
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆26Updated 2 years ago
- Document Search Engine project with TF-IDF abd Google universal sentence encoder model☆54Updated 2 years ago
- An app that extracts your twitter threads into a downloadable CSV file.☆10Updated 2 years ago
- Model for predicting categories of entities by its mentions☆29Updated 3 years ago
- Extracts a latent knowledge graph from text and index/query it in elasticsearch or solr☆20Updated 3 years ago
- Semantic Segmentation of Legal texts that labels sentences with one of 7 rhetorical roles.☆72Updated 10 months ago
- Neural Elastic Inference and Search☆19Updated 5 years ago
- Use Kore.ai's Knowledge Graph Generator to automatically extract terms from FAQs, define the hierarchy between these terms, and also asso…☆15Updated last year
- An easy to use framework for large-scale fact-checking and question answering☆69Updated last year
- A python module to process data for Frame Semantic Parsing☆24Updated 4 years ago
- sequence tagging with spaCy and crfsuite☆19Updated 2 years ago
- Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Sear…☆85Updated 4 years ago
- simple rule based named entity recognition☆43Updated 3 years ago
- Keyword extraction with spaCy☆31Updated 3 years ago
- ☆18Updated 2 years ago
- Extracting narrative timelines (i.e. order and timing of events) from text☆20Updated 6 years ago
- 📄Neural Sentential Paraphrase Generation to Augment Chatbot Training Dataset☆21Updated 2 years ago
- Tool for disambiguating acronyms and abbreviations in text for NLP applications☆22Updated 11 months ago
- An open-source NLP library: fast text cleaning and preprocessing☆23Updated 3 years ago
- Text similarity using BERT sentence embeddings☆20Updated 5 years ago
- This repository contains code and data download instructions for the workshop paper "Improving Hierarchical Product Classification using …☆17Updated 4 years ago
- Training a model without a dataset for natural language inference (NLI)☆25Updated 4 years ago
- StAtutory Reasoning Assessment☆13Updated 2 years ago
- NLG Best Practices for Data-Efficient Modeling How to Train Production-Ready Models with Little Data☆10Updated 3 years ago
- Named entity recognition for the legal domain☆42Updated 3 years ago
- A simple library for training named entity recognition model from partially annotated data☆23Updated last year
- Open-source, knowledge-grounded conversational AI☆13Updated 5 months ago
- ☆16Updated 4 years ago
- Visualizing ELMo Contextual Vectors for Word Sense Disambiguation☆15Updated 4 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago