theShadow89 / nifi-bigquery-bundle
Bigquery bundle for Apache NiFi
☆15Updated 5 years ago
Alternatives and similar repositories for nifi-bigquery-bundle:
Users that are interested in nifi-bigquery-bundle are comparing it to the libraries listed below
- JSON schema parser for Apache Spark☆81Updated 2 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆88Updated last year
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 8 years ago
- These are some code examples☆55Updated 5 years ago
- Examples for High Performance Spark☆15Updated 4 months ago
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆66Updated last month
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆72Updated 4 years ago
- Scalable CDC Pattern Implemented using PySpark☆18Updated 5 years ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Updated last year
- Avro Schema Shredder is a REST API that enables storage of Avro Schemas in Apache Atlas. This API enables an organization to use Apache A…☆13Updated 8 years ago
- Single view demo☆14Updated 9 years ago
- A rough prototype of a tool for discovering Apache Hive schemas from JSON documents.☆42Updated last year
- Schema Registry integration for Apache Spark☆40Updated 2 years ago
- This repository is to help with the Partner Demonstration of the Apache Atlas project.☆30Updated 9 years ago
- HDF masterclass materials☆28Updated 8 years ago
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.☆75Updated 11 months ago
- Random implementation notes☆33Updated 11 years ago
- A facebook for data☆26Updated 5 years ago
- Flink stream filtering examples☆19Updated 8 years ago
- Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.☆70Updated last year
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- A Spark metrics sink that pushes to InfluxDb☆51Updated 4 years ago
- Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline☆75Updated 2 years ago
- Collection of examples integrating NiFi with stream process frameworks.☆58Updated 8 years ago
- The iterative broadcast join example code.☆69Updated 7 years ago
- File compaction tool that runs on top of the Spark framework.☆59Updated 5 years ago
- type-class based data cleansing library for Apache Spark SQL☆78Updated 5 years ago
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Updated 3 years ago
- An Apache access log parser written in Scala☆72Updated 4 years ago