danmalczyk / kdoc
Dockerized Kylo deployed as multiple layers and services for fast redeploys from source.
☆13Updated 7 years ago
Alternatives and similar repositories for kdoc:
Users that are interested in kdoc are comparing it to the libraries listed below
- ☆18Updated 7 years ago
- Enables synchronizing metadata changes (Create/Drop table/partition) from Hive Metastore to AWS Glue Data Catalog☆35Updated last year
- Schema Registry☆16Updated 10 months ago
- Deploy Presto on the cloud easily, using Terraform and Packer☆45Updated 2 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆88Updated last year
- In-deprecation. For Lenses please check lensesio/lenses-helm-charts. Soon Stream Reactor will also get its own Helm repository.☆70Updated 4 years ago
- CDAP Kubernetes Operator☆19Updated last week
- Apache Nifi cluster running in kubernetes☆84Updated 5 years ago
- Example projects for using Spark and Cassandra With DSE Analytics☆58Updated last year
- Sample Apache Beam pipeline that can be deployed to Amazon Managed Service for Apache Flink. It reads taxi events from a Kinesis data str…☆47Updated last year
- Reference architecture for real-time stream processing with Apache Flink on Amazon EMR, Amazon Kinesis, and Amazon Elasticsearch Service.☆71Updated last year
- Apache DataLab (incubating)☆153Updated last year
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆66Updated 2 months ago
- The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this r…☆62Updated last year
- Airflow on Kubernetes Operator☆89Updated 2 years ago
- Export Airflow metrics (from mysql) in prometheus format☆29Updated 2 weeks ago
- This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concu…☆75Updated 6 years ago
- SQL for Kafka Connectors☆98Updated last year
- Apache Drill Dialect for SQL Alchemy☆54Updated 2 months ago
- Vagrant files creating multi-node virtual Hadoop clusters with or without security.☆67Updated 4 years ago
- Sample Apache Flink application that can be deployed to Kinesis Analytics for Java. It reads taxi events from a Kinesis data stream, proc…☆85Updated last year
- Terraform provider for interacting with NiFi cluster☆51Updated 5 years ago
- Apache NiFi Registry☆109Updated 3 years ago
- Spark Scala docker container sample for AWS testing - EKS & S3☆24Updated 6 years ago
- Cloudera Director sample code☆61Updated 5 years ago
- JUnit integration for testing the Apache Hive Metastore and HiveServer2 Thrift APIs☆26Updated 4 months ago
- ❤for real-time DataOps - where the application and data fabric blends - Lenses☆156Updated last week
- Streaming ETL with Apache Flink and Amazon Kinesis Data Analytics☆64Updated last year
- Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.☆20Updated 3 years ago
- compatibility tests to make sur C and Java implementations can read each other☆68Updated 3 years ago