pawelrychlik / duplitectorLinks
A duplicate data detector engine PoC based on Elasticsearch.
☆20Updated 10 years ago
Alternatives and similar repositories for duplitector
Users that are interested in duplitector are comparing it to the libraries listed below
Sorting:
- Google Drive river for Elasticsearch☆20Updated 10 years ago
- This is where you can find all the code you need for the ELUNA 2019 Developers Day+ Alma Workshop.☆10Updated 6 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆35Updated 9 years ago
- A command line and Python client for Open-Spending☆10Updated 7 years ago
- Analyze standard numbers like ARK, DOI, EAN, GTIN, IBAN, ISAN, ISBN, ISMN, ISNI, ISSN, ISTC, ISWC, ORCID, PPN, SICI, UPC, ZDB with Elasti…☆24Updated 9 years ago
- Python interface for OrientDB binary Serialization☆10Updated 5 years ago
- This is an aggregation of technical resources for those who are new to coding and would like to learn the basic concepts.☆10Updated 7 years ago
- FacetView is a pure javascript frontend for ElasticSearch.☆291Updated 10 years ago
- A semantic analysis tool to generate synonym.txt files for Solr. [RETIRED]☆25Updated 9 years ago
- Web Design / July 2015 / Group 1 (Sundays & Tuesdays 18-21)☆10Updated 10 years ago
- MPC5744的UCOSII移植☆10Updated 6 years ago
- Baseform lemmatization for Elasticsearch☆26Updated 6 years ago
- Curiosity is a generic frontend for facetting, displaying and editing data from any elasticsearch index.☆75Updated 9 years ago
- ☆10Updated 3 years ago
- Customised UITextField and UITextView with HintLabel, ErrorLabel, Divider and validations☆10Updated 9 years ago
- A backend service for the Push-Android app to connect and pull data from.☆10Updated 2 years ago
- Small hack to use list of attendees and startups at Web Summit 2015☆10Updated 10 years ago
- Práctica del Workshop de NLP #NodeConfAR2017☆10Updated 8 years ago
- Full text extraction using the Open Source Tesseract OCR software https://code.google.com/p/tesseract-ocr/ and imagemagick☆13Updated 10 years ago
- Easy answers to citizens questions☆61Updated 9 years ago
- Python natural language processing work☆29Updated 16 years ago
- ☆20Updated 8 years ago
- ☆10Updated 6 years ago
- A metadistribution of RDF.rb including all parsing/serialization plugins.☆52Updated 5 months ago
- 😂 Predicts an emoji based on sentiment & semantic analyses for a test dataset based on a training dataset of tweets.☆10Updated 8 years ago
- a CLI suggestion tool for Wikidata entities☆30Updated 9 years ago
- Hunspell analysis for ElasticSearch☆38Updated 13 years ago
- Analysis plugin for ElasticSearch providing capability for processing inline annotations in documents.☆35Updated 11 years ago
- ☆10Updated 7 years ago
- A bundle of useful Elasticsearch plugins☆112Updated last year