andreybratus/RefineOnSpark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/andreybratus/RefineOnSpark)

andreybratus / RefineOnSpark

☆34

Alternatives and similar repositories for RefineOnSpark

Users that are interested in RefineOnSpark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fusepoolP3 / p3-batchrefine
View on GitHub
BatchRefine adds batch processing capabilities to OpenRefine
☆50Dec 14, 2016Updated 9 years ago
subhh / HOS-MetadataTransformations
View on GitHub
DEPRECATED - no longer actively maintained. Automated workflow for harvesting, transforming and indexing of metadata using metha, OpenRef…
☆19Apr 3, 2020Updated 6 years ago
uwegeercken / nifi_processors
View on GitHub
Java code for Apache Nifi processors
☆11Jun 5, 2017Updated 9 years ago
ror-community / ror-reconciler
View on GitHub
OpenRefine reconciler for Research Organization Registry
☆13Feb 13, 2026Updated 5 months ago
davies / tpcds-kit
View on GitHub
TPC-DS benchmark kit with some modifications/additions
☆10Nov 12, 2015Updated 10 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
fusepoolP3 / p3-platform-reference-implementation
View on GitHub
Fusepool P3 Platform Reference Implementation
☆13Apr 7, 2016Updated 10 years ago
shivajid / HortonworksOperationsWorkshop
View on GitHub
☆14Oct 14, 2015Updated 10 years ago
prateek / nifi-parcel
View on GitHub
☆23Apr 4, 2018Updated 8 years ago
cmharlow / GetUrRecon
View on GitHub
All that entity matching, resolution, normalization, enhancement and reconciliation madness, but with a focus on data, not platforms.
☆24Feb 5, 2022Updated 4 years ago
RBGKew / Reconciliation-and-Matching-Framework
View on GitHub
A framework to allow the matching of string entities using customised sets of transformations and matchers, plus a tool to produce the ne…
☆34Apr 18, 2017Updated 9 years ago
williamjturkel / Digital-History-Hacks--2005-08-
View on GitHub
Source code repository for Digital History Hacks
☆25Jun 16, 2013Updated 13 years ago
zladovan / gradle-avrohugger-plugin
View on GitHub
Gradle plugin for generating scala case classes from apache avro schemas, datafiles and protocols
☆12May 11, 2025Updated last year
dsrkoc / groovy-sql-stream-extension
View on GitHub
Efficient Groovy data set processing using common collection methods such as `collect`, `findAll`, etc.
☆17Jun 2, 2015Updated 11 years ago
siia-fisd / altdata-council
View on GitHub
best practices and standards for the delivery of alternative data to the investment industry
☆11Apr 21, 2026Updated 3 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
InfoSeeking / Socrates
View on GitHub
A platform for collecting, analyzing, and visualizing social media data.
☆13Dec 27, 2020Updated 5 years ago
asdaraujo / filecrush
View on GitHub
Remedy small files by combining them into larger ones.
☆23Oct 31, 2018Updated 7 years ago
data-describe / awesome-data-science-models
View on GitHub
A few end to end examples that use data-describe
☆17May 2, 2023Updated 3 years ago
mets / METS-schema
View on GitHub
METS 1.x and METS 2 schemas
☆26May 28, 2025Updated last year
pvillard31 / nifi-gcp-terraform
View on GitHub
Terraform / NiFi on the Google Cloud Platform
☆29Nov 12, 2024Updated last year
bigdataprotocol / bdp-contracts
View on GitHub
☆11Mar 4, 2021Updated 5 years ago
contribution-jhipster-uga / generator-jhipster-stripe-payment
View on GitHub
JHipster module, this module integrate the payment plateform STRIPE to a Jhipster project. (It includes a web payment page and a JHipster…
☆12Dec 8, 2022Updated 3 years ago
smoqadam / add-to-feedly
View on GitHub
a firefox extension to add a website to feedly ;)
☆10Nov 22, 2017Updated 8 years ago
projectblacklight / blacklight-maps
View on GitHub
Map search results view for Blacklight
☆14Feb 21, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
stkenny / grefine-rdf-extension
View on GitHub
An extension to OpenRefine that enables graphical mapping of OpenRefine project data to an RDF skeleton and then exporting it in RDF form…
☆81Jun 28, 2025Updated last year
alantygel / ckanext-tagmanager
View on GitHub
Tag management for CKAN
☆10Aug 20, 2024Updated last year
pm5 / node-openrefine
View on GitHub
OpenRefine client in Node.js
☆16Mar 16, 2022Updated 4 years ago
TheRockStarDBA / awesome-public-datasets
View on GitHub
An awesome list of high-quality open datasets in public domains (on-going).
☆11Nov 20, 2015Updated 10 years ago
jkubrynski / profiling
View on GitHub
☆11Oct 12, 2013Updated 12 years ago
open511 / Open511API
View on GitHub
Code for open511.org
☆12Jan 20, 2021Updated 5 years ago
la-team / lightadmin-jhipster
View on GitHub
LightAdmin and JHipster integration example
☆18Dec 17, 2023Updated 2 years ago
unitedstates / petitions
View on GitHub
White House petition crawler.
☆15Mar 6, 2013Updated 13 years ago
Data-Nutrition-Project / dnp-website
View on GitHub
☆15Oct 16, 2025Updated 9 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ucbrise / jarvis
View on GitHub
Build, configure, and track workflows with Jarvis.
☆14Apr 17, 2018Updated 8 years ago
baffelli / covid-2019-measures
View on GitHub
☆11Apr 15, 2020Updated 6 years ago
betatim / openrefineder
View on GitHub
💠 + 📚 OpenRefine on Binder!
☆41Jun 11, 2024Updated 2 years ago
samuelmeuli / python-wikibase
View on GitHub
🤖 Wikibase queries and edits made easy
☆11Feb 9, 2020Updated 6 years ago
CrossRef / jats-crossref-xslt
View on GitHub
JATS to CrossRef deposit XML translation via an XSLT
☆12Sep 27, 2019Updated 6 years ago
lcnetdev / PREMIS
View on GitHub
☆10Apr 26, 2026Updated 2 months ago
elliewix / LIS452-Spring2017Lectures
View on GitHub
☆10Jun 16, 2017Updated 9 years ago