theShadow89 / nifi-bigquery-bundleLinks
Bigquery bundle for Apache NiFi
☆15Updated 6 years ago
Alternatives and similar repositories for nifi-bigquery-bundle
Users that are interested in nifi-bigquery-bundle are comparing it to the libraries listed below
Sorting:
- Examples for High Performance Spark☆16Updated 7 months ago
- A rough prototype of a tool for discovering Apache Hive schemas from JSON documents.☆42Updated last year
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆61Updated 9 months ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆88Updated last year
- CDF Tech Bootcamp☆9Updated 5 years ago
- HDF masterclass materials☆28Updated 9 years ago
- Spark to Tableau Extractor library☆18Updated 7 years ago
- A library you can include in your Spark job to validate the counters and perform operations on success. Goal is scala/java/python support…☆109Updated 7 years ago
- XML Serializer/Deserializer for Apache Hive☆41Updated 5 years ago
- Google BigQuery support for Spark, SQL, and DataFrames☆155Updated 5 years ago
- A sample implementation of the Spark Datasource API☆24Updated 8 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.☆20Updated 3 years ago
- Apache Atlas development image for the Rokku project: https://github.com/ing-bank/rokku☆21Updated 5 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.☆70Updated 2 years ago
- Code to index Hive tables to Solr and Solr indexes to Hive☆48Updated 6 years ago
- File compaction tool that runs on top of the Spark framework.☆59Updated 6 years ago
- Spark data source for Salesforce☆80Updated last year
- ☆71Updated 4 years ago
- A framework for creating composable and pluggable data processing pipelines using Apache Spark, and running them on a cluster.☆47Updated 8 years ago
- ☆48Updated 7 years ago
- Single view demo☆14Updated 9 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 8 years ago
- Example projects for using Spark and Cassandra With DSE Analytics☆58Updated 2 years ago
- Monitor Twitter stream for S&P 500 companies to identify & act on unexpected increases in tweet volume☆39Updated 9 years ago
- Sample processing code using Spark 2.1+ and Scala☆51Updated 4 years ago
- Spark cloud integration: tests, cloud committers and more☆19Updated 4 months ago
- Flink stream filtering examples☆19Updated 9 years ago
- Avro Schema Shredder is a REST API that enables storage of Avro Schemas in Apache Atlas. This API enables an organization to use Apache A…☆13Updated 8 years ago