BitwiseInc / HydrographLinks
A visual ETL development and debugging tool for big data
☆154Updated 3 years ago
Alternatives and similar repositories for Hydrograph
Users that are interested in Hydrograph are comparing it to the libraries listed below
Sorting:
- StreamLine - Streaming Analytics☆164Updated 2 years ago
- Big Data ETL and Utilities for Hadoop Map Reduce, Spark and Storm☆103Updated last year
- This repository is to help with the Partner Demonstration of the Apache Atlas project.☆30Updated 10 years ago
- Quark is a data virtualization engine over analytic databases.☆100Updated 8 years ago
- Demos around Ambari Views, Services, Blueprints☆63Updated 9 years ago
- Collection of examples integrating NiFi with stream process frameworks.☆59Updated 9 years ago
- Build configuration-driven ETL pipelines on Apache Spark☆162Updated 3 years ago
- Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…☆138Updated 3 years ago
- An adapter for Oracle GoldenGate to push change capture data directly to an Apache Kafka cluster☆53Updated 10 years ago
- Code to index Hive tables to Solr and Solr indexes to Hive☆47Updated 6 years ago
- The SpliceSQL Engine☆170Updated 2 years ago
- Mirror of Apache Apex malhar☆133Updated 6 years ago
- DataQuality for BigData☆145Updated 2 years ago
- Apache MiNiFi (a subproject of Apache NiFi)☆125Updated 4 years ago
- ☆107Updated 3 years ago
- Ambari service for Apache Zeppelin notebook☆71Updated 8 years ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆52Updated 6 months ago
- ☆50Updated 5 years ago
- Apache DataLab (incubating)☆153Updated 2 years ago
- Apache NiFi example flows☆210Updated 5 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆91Updated last year
- Easily make RESTful web services for time series reporting with Big Data analytics engines like Druid and SQL Databases.☆175Updated 2 years ago
- A proof of concept using Divolte, Kafka, Druid and Superset☆62Updated 5 years ago
- Dockerized HDP Cluster☆84Updated 7 years ago
- Demonstrates NiFi template deployment and configuration via a REST API☆70Updated 8 years ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆281Updated 7 years ago
- CDP Public Cloud is an integrated analytics and data management platform deployed on cloud services. It offers broad data analytics and a…☆361Updated last week
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Updated 4 years ago
- ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.☆280Updated 6 years ago
- Example of running MDX on Druid via Mondrian and Calcite☆26Updated 9 years ago