theShadow89 / nifi-bigquery-bundleLinks
Bigquery bundle for Apache NiFi
☆15Updated 6 years ago
Alternatives and similar repositories for nifi-bigquery-bundle
Users that are interested in nifi-bigquery-bundle are comparing it to the libraries listed below
Sorting:
- Spark data source for Salesforce☆80Updated last year
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆90Updated last year
- Google BigQuery support for Spark, SQL, and DataFrames☆155Updated 5 years ago
- Examples for High Performance Spark☆16Updated last month
- Kinesis Connector for Structured Streaming☆137Updated last year
- ☆70Updated 4 years ago
- Hive Storage Handler for interoperability between BigQuery and Apache Hive☆19Updated 8 months ago
- Spark package for checking data quality☆221Updated 5 years ago
- XML Serializer/Deserializer for Apache Hive☆41Updated 6 years ago
- Data ingestion library for Amundsen to build graph and search index☆205Updated last year
- Spark connector for SFTP☆98Updated 2 years ago
- A COBOL parser and Mainframe/EBCDIC data source for Apache Spark☆154Updated 2 weeks ago
- Loads Snowplow enriched events into Google BigQuery☆22Updated 6 months ago
- Snowflake Data Source for Apache Spark.☆230Updated last week
- Multiple node presto cluster on docker container☆126Updated 3 years ago
- JSON schema parser for Apache Spark☆82Updated 3 years ago
- hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.☆28Updated 7 years ago
- File compaction tool that runs on top of the Spark framework.☆59Updated 6 years ago
- Front-end service library for Amundsen☆279Updated 3 weeks ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 6 years ago
- DEPRECATED. PLEASE USE https://github.com/confluentinc/kafka-connect-bigquery. A Kafka Connect BigQuery sink connector☆152Updated last year
- ☆31Updated 7 years ago
- Spark Structured Streaming State Tools☆34Updated 5 years ago
- Task Metrics Explorer☆14Updated 6 years ago
- The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog a…☆225Updated 7 months ago
- Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive☆184Updated 2 weeks ago
- HDF masterclass materials☆29Updated 9 years ago
- Reference architecture for real-time stream processing with Apache Flink on Amazon EMR, Amazon Kinesis, and Amazon Elasticsearch Service.☆72Updated last year
- Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.☆70Updated 2 years ago
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆70Updated 2 months ago