ml-for-nlp / build-your-search-engineLinks
A repository to learn basic data processing techniques (Wikipedia processing, feature selection), and use them for some basic Web query classification.
☆27Updated 3 years ago
Alternatives and similar repositories for build-your-search-engine
Users that are interested in build-your-search-engine are comparing it to the libraries listed below
Sorting:
- Metadata Extractor & Loader (MEL) ■ The NLP-NER Toolkit (TNNT)☆25Updated 2 years ago
- NLP Cloud serves high performance pre-trained or custom models for NER, sentiment-analysis, classification, summarization, paraphrasing, …☆87Updated last year
- Document Search Engine project with TF-IDF abd Google universal sentence encoder model☆55Updated 2 years ago
- Document Search Engine Tool☆76Updated 3 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 3 years ago
- SLING - A natural language frame semantics parser☆172Updated this week
- 🚀GUI for training spaCy models☆55Updated 4 years ago
- Document level Attitude and Relation Extraction toolkit (AREkit) for sampling and processing large text collections with ML and for ML☆65Updated 11 months ago
- This repository provides various Python methods for finding and aggregating synonyms for an individual word or a list of words.☆35Updated 2 years ago
- Tools to construct and process Common Crawl webgraphs☆103Updated 3 weeks ago
- Graph databases, Knowledge Graphs, SPARQ☆82Updated 4 years ago
- Summarize. is a Streamlit application that performs automatic text summarization using both extractive and abstractive models.☆16Updated 4 years ago
- Open Source REST API for named entity extraction, named entity linking, named entity disambiguation, recommendation & reconciliation of e…☆196Updated 3 years ago
- ☆55Updated last week
- Get annotation suggestions for the INCEpTION text annotation platform from spaCy, Sentence BERT, scikit-learn and more. Runs as a web-ser…☆47Updated 2 months ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆65Updated last week
- 🖍️ Highlight text in documents☆111Updated 8 months ago
- 🏖TagEditor - Annotation tool for spaCy☆193Updated 3 years ago
- RaKUn 2.0 - A fast keyword detection algorithm☆69Updated 5 months ago
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆69Updated 5 years ago
- This Python module can be used to obtain antonyms, synonyms, hypernyms, hyponyms, homophones and definitions.☆126Updated last year
- Fast and robust date extraction from web pages, with Python or on the command-line☆143Updated 2 months ago
- Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Sear…☆86Updated 4 years ago
- In-memory Graph Database and Knowledge Graph with Natural Language Interface, compatible with Pandas☆59Updated 3 months ago
- Applying BERT for named entity recognition on resumes.☆68Updated 2 years ago
- LegalCrawler: A tool for automated scraping of English legal corpora☆59Updated 3 years ago
- 📂 Additional lookup tables and data resources for spaCy☆113Updated 7 months ago
- A framework for converting natural language text inputs to corresponding Pandas, MongoDB, Kusto and Neo4j (Cypher) queries.☆92Updated last year
- tool for collectively summarizing large discussions☆145Updated 3 years ago
- A demo that shows how to build a semantic search experience with Typesense's vector search feature and Instantsearch.js☆27Updated 2 years ago