agora-team / elasticsearch-synonyms
Curated synonym files and Helpers for Elasticsearch Synonym Token Filter
☆63Updated last year
Related projects: ⓘ
- email dataset for email signature parsing☆52Updated 8 years ago
- For extracting measurements and related entities from text☆56Updated 4 years ago
- Lucene Auto Phrase TokenFilter implementation☆59Updated 6 years ago
- Raw Wikipedia counts for entity linking☆19Updated 7 years ago
- Extract postal addresses from the DOM☆65Updated 12 years ago
- A simple algorithm for clustering web pages, suitable for crawlers☆34Updated 7 years ago
- "Stop worrying about Elasticsearch analyzers", my therapist says☆154Updated 3 years ago
- Hunspell extension for spaCy 2.0.☆94Updated last month
- Dice.com tutorial on using black box optimization algorithms to do relevancy tuning on your Solr Search Engine Configuration from Simon H…☆28Updated 5 years ago
- Launch AWS Elastic MapReduce jobs that process Common Crawl data.☆49Updated 7 years ago
- Tools and data for creating DBpedia Spotlight models.☆37Updated 2 years ago
- Demonstration of using Python to process the Common Crawl dataset with the mrjob framework☆166Updated 2 years ago
- ☆21Updated 6 years ago
- A text tagger based on Lucene / Solr, using FST technology☆173Updated 9 months ago
- Text classification using Naive Bayes and Elasticsearch☆154Updated 8 years ago
- A toolkit for clustering web pages based on various similarity measures.☆32Updated 2 years ago
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆80Updated 6 years ago
- Language detection extension for spaCy 2.0+☆111Updated 5 years ago
- Python wrapper for Apache OpenNLP tools☆34Updated 7 years ago
- a Deep Learning based Speller☆27Updated 5 years ago
- A natural language search microservice☆95Updated 3 years ago
- A python library detect and extract listing data from HTML page.☆109Updated 7 years ago
- LanguageCrunch NLP server docker image☆287Updated last year
- This is a REST Server endpoint built using Flask and Python.☆23Updated last year
- Relatively simple text classification powered by spaCy☆42Updated 8 years ago
- Entity resolution for Elasticsearch.☆156Updated last month
- Extract synonyms, keywords from sentences using modified implementation of Aho Corasick algorithm☆40Updated 7 years ago
- Wikipedia-based keyword extraction tool in Java☆21Updated 9 years ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆65Updated last year
- Elasticsearch/Solr Sandbox for exploring explain information and tweaking☆135Updated 6 months ago