pawelrychlik / duplitector
A duplicate data detector engine PoC based on Elasticsearch.
☆20Updated 10 years ago
Alternatives and similar repositories for duplitector:
Users that are interested in duplitector are comparing it to the libraries listed below
- A command line and Python client for Open-Spending☆10Updated 7 years ago
- a pure javascript frontend for ElasticSearch search indices.☆79Updated 7 years ago
- an aggregation function for PostgreSQL☆40Updated 3 months ago
- LoadKit supports Extract, Transform, Load processes based on ArchiveKit buckets.☆11Updated 9 years ago
- **el.vis** - a tool for visualising public (EU) tenders big data☆8Updated last year
- Github mirror of "search/highlighter" - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access…☆102Updated last month
- CSV grooming, the JS way☆21Updated 5 years ago
- To promote exploration and use of open data - currently in beta☆14Updated 7 years ago
- Next-gen web application for public finance data warehouses, formerly OpenSpending☆57Updated 2 years ago
- Tables is a simple command-line tool and powerful library for importing data like a CSV or JSON file into relational tables☆88Updated 2 years ago
- Demonstration of how dedupe might be used as geocoder☆17Updated 2 years ago
- An elasticsearch plugin to create hierarchical aggregations☆51Updated last week
- Load CSV files into Postgres without explicit schema creation.☆81Updated 3 years ago
- Data notification service: subscribe to keywords and get notified whenever an open data sources mentions that keyword.☆24Updated 11 years ago
- Docker container to provide Apache Tika RESTful API☆41Updated 9 years ago
- Crowdsourcing platform for gathering gender information about politicians to improve the data in EveryPolitician☆15Updated 4 years ago
- Reusable sparkline chart in D3.js, similar to mashable's 'velocity graph'☆56Updated 7 years ago
- FacetView is a pure javascript frontend for ElasticSearch.☆291Updated 9 years ago
- A bundle of useful Elasticsearch plugins☆110Updated 11 months ago
- elasticsearch schema files and tooling☆40Updated last month
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 9 years ago
- "Stop worrying about Elasticsearch analyzers", my therapist says☆155Updated 3 years ago
- Test using WebWorkers to run D3 geo projection☆10Updated 6 years ago
- ***DEPRECATED: No longer being built*** A Radar (or Spider) Chart plugin for Kibi 0.3.x+ or Kibana 4.3.x+ free as in beer and speech, enj…☆35Updated 4 years ago
- Creates a static API from a CSV file☆29Updated 8 years ago
- Bash script to install Nominatim☆68Updated 8 years ago
- [DEPRECATED] Please use https://goodtables.io☆13Updated 8 years ago
- Dedupe/batch geocode addresses and venues around the world with libpostal☆83Updated 3 years ago
- JSON schemas for OpenCorporates data☆20Updated 10 months ago
- International legislative data specifications☆100Updated 2 years ago