pawelrychlik / duplitectorLinks
A duplicate data detector engine PoC based on Elasticsearch.
☆20Updated 10 years ago
Alternatives and similar repositories for duplitector
Users that are interested in duplitector are comparing it to the libraries listed below
Sorting:
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆35Updated 9 years ago
- Google Drive river for Elasticsearch☆20Updated 11 years ago
- A command line and Python client for Open-Spending☆10Updated 8 years ago
- Python interface for OrientDB binary Serialization☆10Updated 5 years ago
- Customised UITextField and UITextView with HintLabel, ErrorLabel, Divider and validations☆10Updated 9 years ago
- ☆10Updated 6 years ago
- Term List Matching Plugin for ElasticSearch☆26Updated 11 years ago
- Baseform lemmatization for Elasticsearch☆26Updated 6 years ago
- MPC5744的UCOSII移植☆10Updated 6 years ago
- Verteego Data Suite☆10Updated 8 years ago
- Práctica del Workshop de NLP #NodeConfAR2017☆10Updated 8 years ago
- Hunspell analysis for ElasticSearch☆38Updated 13 years ago
- A tiny Java program that shows the current calendar week in a system tray☆11Updated 8 years ago
- ☆10Updated 7 years ago
- Web Design / July 2015 / Group 1 (Sundays & Tuesdays 18-21)☆10Updated 10 years ago
- ScaleGraph is an X10 billion scale graph analysis library.☆21Updated 9 years ago
- Web frontend for Myria☆12Updated 5 years ago
- ☆20Updated 8 years ago
- FacetView is a pure javascript frontend for ElasticSearch.☆291Updated 10 years ago
- .NET Client for Algorithmia Algorithms and Data API☆10Updated 6 years ago
- The GitHub repository for the Copenhagen Dependency Treebanks exported from Google Code. The repository is still in the process of being …☆11Updated 5 years ago
- A backend service for the Push-Android app to connect and pull data from.☆10Updated 2 years ago
- ☆10Updated 7 years ago
- Small hack to use list of attendees and startups at Web Summit 2015☆10Updated 10 years ago
- ☆10Updated 9 years ago
- this is cmy☆10Updated 6 years ago
- A stemmer for Slovak language☆12Updated 8 years ago
- Shave pages off of PDFs as images☆59Updated 7 years ago
- ☆10Updated 3 years ago
- Full text extraction using the Open Source Tesseract OCR software https://code.google.com/p/tesseract-ocr/ and imagemagick☆13Updated 10 years ago