streamsets / pipeline-libraryLinks
Pipeline library for StreamSets Data Collector and Transformer
☆33Updated 2 years ago
Alternatives and similar repositories for pipeline-library
Users that are interested in pipeline-library are comparing it to the libraries listed below
Sorting:
- Scalable CDC Pattern Implemented using PySpark☆18Updated 5 years ago
- Cask Hydrator Plugins Repository☆68Updated 2 weeks ago
- A Flink applcation that demonstrates reading and writing to/from Apache Kafka with Apache Flink☆20Updated last year
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Updated 4 years ago
- ☆49Updated 5 years ago
- Code that was used as an example during the Data+AI Summit 2020☆15Updated 4 years ago
- Yet Another (Spark) ETL Framework☆21Updated last year
- Infrastructure automation to deploy Hadoop,Hive,Spark,airflow nodes on a docker host☆20Updated 6 years ago
- ☆30Updated 2 weeks ago
- A bridge to Apache Atlas for provenance metadata created in course of using Apache NiFi☆15Updated 2 years ago
- ☆14Updated 3 weeks ago
- Collection of examples integrating NiFi with stream process frameworks.☆59Updated 8 years ago
- Profiles the data, validates the schema and runs data quality checks and produces a report☆20Updated 6 years ago
- Stocks -> NiFi -> Kafka -> Profit☆14Updated 6 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 8 years ago
- Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http:…☆70Updated 2 years ago
- ☆26Updated 4 years ago
- Apache Nifi Examples by http://www.nifi.rocks☆38Updated 6 years ago
- HDF masterclass materials☆28Updated 9 years ago
- spark-drools tutorials☆16Updated last year
- Demonstration of a Hive Input Format for Iceberg☆26Updated 4 years ago
- ☆27Updated last year
- Examples of Spark 3.0☆47Updated 4 years ago
- Ambari View for the Ambari Store☆15Updated 9 years ago
- Ecosystem website for Apache Flink☆12Updated last year
- data-mesh-demo☆13Updated 3 years ago
- A complete custom processor project, for your reference.☆18Updated 9 years ago
- Delta Lake Examples☆12Updated 5 years ago
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16Updated 6 years ago
- Mastering Spark for Data Science, published by Packt☆47Updated 2 years ago