cloudboxlabs / blog-codeLinks
Cloudbox Labs blog code
☆35Updated 7 years ago
Alternatives and similar repositories for blog-code
Users that are interested in blog-code are comparing it to the libraries listed below
Sorting:
- ❤for real-time DataOps - where the application and data fabric blends - Lenses☆160Updated 2 weeks ago
- DEPRECATED. PLEASE USE https://github.com/confluentinc/kafka-connect-bigquery. A Kafka Connect BigQuery sink connector☆152Updated last year
- Fully reproducible, Dockerized, step-by-step, demo on how to stream tables from Postgres to Kafka/KSQL back to Postgres. Detailed blog p…☆152Updated 4 years ago
- Spark Scala docker container sample for AWS testing - EKS & S3☆24Updated 7 years ago
- An example Apache Beam project.☆111Updated 8 years ago
- Reference architecture for real-time stream processing with Apache Flink on Amazon EMR, Amazon Kinesis, and Amazon Elasticsearch Service.☆70Updated last year
- Mirror of Apache Beam☆10Updated 5 years ago
- Streaming ETL with Apache Flink and Amazon Kinesis Data Analytics☆65Updated 2 years ago
- Cloudformation templates for deploying Airflow in ECS☆40Updated 7 years ago
- These are some code examples☆56Updated 6 years ago
- Docker Image and Kubernetes Configurations for Spark 2.x☆41Updated 6 years ago
- Kafka Streams Example - Joining streams to generate rich clickstream analysis data☆39Updated 3 years ago
- Terraform module to deploy an Apache Airflow cluster on AWS, backed by RDS PostgreSQL for metadata, S3 for logs and SQS as message broker…☆84Updated 3 years ago
- The open source version of the Amazon Athena documentation. To submit feedback & requests for changes, submit issues in this repository, …☆84Updated 2 years ago
- Examples on how to use the command line tools in Avro Tools to read and write Avro files☆152Updated last year
- Terraform module for a PostgreSQL-backed Apache Airflow instance☆24Updated 7 years ago
- An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR☆175Updated 8 months ago
- Get started with Apache Beam and Flink☆43Updated 9 years ago
- Experiments and demonstrations of AVRO, Protobuf serialisation☆61Updated 3 years ago
- Presentations and other resources.☆36Updated 5 years ago
- Docker container for Kafka - Spark Streaming - Cassandra☆97Updated 6 years ago
- Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline☆76Updated 2 years ago
- Some AWS EMR examples☆16Updated 8 years ago
- Data validation library for PySpark 3.0.0☆33Updated 3 years ago
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆70Updated 5 months ago
- AWS Lambda function to get events in Kafka topic when files are uploaded to S3☆24Updated 7 years ago
- Example Spark applications that run on Kubernetes and access GCP products, e.g., GCS, BigQuery, and Cloud PubSub☆37Updated 7 years ago
- Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…☆138Updated 3 years ago
- A curated list of all the awesome examples, articles, tutorials and videos for Apache Airflow.☆96Updated 5 years ago
- Airflow workflow management platform chef cookbook.☆70Updated 6 years ago