☆34Sep 22, 2014Updated 11 years ago
Alternatives and similar repositories for RefineOnSpark
Users that are interested in RefineOnSpark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- BatchRefine adds batch processing capabilities to OpenRefine☆50Dec 14, 2016Updated 9 years ago
- Edit-distance-based similar string joiner and clusterer☆18Jul 2, 2015Updated 10 years ago
- DEPRECATED - no longer actively maintained. Automated workflow for harvesting, transforming and indexing of metadata using metha, OpenRef…☆19Apr 3, 2020Updated 6 years ago
- Source code repository for Digital History Hacks☆24Jun 16, 2013Updated 12 years ago
- Java code for Apache Nifi processors☆11Jun 5, 2017Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An expansive bundle of NiFi additions intended to be used for generating test data☆11Aug 6, 2023Updated 2 years ago
- Tranquility helps you send real-time event streams to Druid and handles partitioning, replication, service discovery, and schema rollover…☆13May 3, 2019Updated 7 years ago
- Sample custom Nifi processor to process tcpdump☆18Nov 19, 2015Updated 10 years ago
- Fusepool P3 Platform Reference Implementation☆13Apr 7, 2016Updated 10 years ago
- Dremio Metabase driver☆17Apr 7, 2020Updated 6 years ago
- ☆14Oct 14, 2015Updated 10 years ago
- Turning human business logic into clear, verifiable instructions that enable LLMs to deliver stable, predictable, and testable AI logic w…☆25May 14, 2026Updated last month
- ☆23Apr 4, 2018Updated 8 years ago
- This repository has migrated to:☆100Oct 11, 2025Updated 8 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Presentation and materials for an IPython Notebook talk☆26Aug 22, 2014Updated 11 years ago
- A framework to allow the matching of string entities using customised sets of transformations and matchers, plus a tool to produce the ne…☆34Apr 18, 2017Updated 9 years ago
- Gradle plugin for generating scala case classes from apache avro schemas, datafiles and protocols☆12May 11, 2025Updated last year
- ☆15Jun 24, 2015Updated 10 years ago
- Remedy small files by combining them into larger ones.☆23Oct 31, 2018Updated 7 years ago
- Get files from ckan into the webstore.☆22Jan 6, 2022Updated 4 years ago
- The OpenRefine Python Client Library provides an interface to communicating with an OpenRefine server.☆180Aug 20, 2019Updated 6 years ago
- An awesome list of high-quality open datasets in public domains (on-going).☆10Nov 20, 2015Updated 10 years ago
- A platform for collecting, analyzing, and visualizing social media data.☆13Dec 27, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A simple pre-made personal website with blogging and social integrations☆38Nov 12, 2023Updated 2 years ago
- Power Query M functions for working with Tabular Data Packages (Frictionless Data) in Power BI and Excel☆29Jun 10, 2021Updated 5 years ago
- ☆11Mar 4, 2021Updated 5 years ago
- An extension to OpenRefine that enables graphical mapping of OpenRefine project data to an RDF skeleton and then exporting it in RDF form…☆81Jun 28, 2025Updated 11 months ago
- Scala Mison implementation☆15Nov 16, 2018Updated 7 years ago
- Terraform / NiFi on the Google Cloud Platform☆29Nov 12, 2024Updated last year
- This is the source code for the AutCar project - Build your own self-driving toy car☆10Oct 1, 2020Updated 5 years ago
- Code for open511.org☆12Jan 20, 2021Updated 5 years ago
- a web based tool to monitor how your website content is used in wikipedia☆37Oct 22, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Python code for Coursera Neural Networks class taught by Professor Geoffrey Hinton☆23Dec 17, 2012Updated 13 years ago
- a graph-based knowledge search engine powered by Wikipedia☆15May 1, 2023Updated 3 years ago
- Wrapper for TransparencyData.com API☆23Feb 14, 2014Updated 12 years ago
- Original GOKb repo - Moving to https://github.com/openlibraryenvironment/gokb☆11Jan 23, 2018Updated 8 years ago
- R Shiny App created to predict the success rate of Freedom of Information Act requests.☆16Dec 11, 2017Updated 8 years ago
- LightAdmin and JHipster integration example☆18Dec 17, 2023Updated 2 years ago
- 💠 + 📚 OpenRefine on Binder!☆41Jun 11, 2024Updated 2 years ago