cloudboxlabs / blog-codeLinks
Cloudbox Labs blog code
☆35Updated 7 years ago
Alternatives and similar repositories for blog-code
Users that are interested in blog-code are comparing it to the libraries listed below
Sorting:
- Spark Scala docker container sample for AWS testing - EKS & S3☆24Updated 7 years ago
- Kafka Connect connector to stream data in real time from Twitter.☆127Updated 3 years ago
- Presentations and other resources.☆36Updated 5 years ago
- Docker Image and Kubernetes Configurations for Spark 2.x☆41Updated 6 years ago
- DEPRECATED. PLEASE USE https://github.com/confluentinc/kafka-connect-bigquery. A Kafka Connect BigQuery sink connector☆152Updated last year
- Ansible playbooks for Apache Spark on kube☆27Updated 8 years ago
- A guide to running Airflow on Kubernetes☆174Updated 6 years ago
- ☆81Updated 2 years ago
- Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline☆76Updated 2 years ago
- Terraform module to deploy an Apache Airflow cluster on AWS, backed by RDS PostgreSQL for metadata, S3 for logs and SQS as message broker…☆84Updated 3 years ago
- Reference architecture for real-time stream processing with Apache Flink on Amazon EMR, Amazon Kinesis, and Amazon Elasticsearch Service.☆70Updated last year
- kubernetes series code☆174Updated 3 years ago
- An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR☆175Updated 7 months ago
- Terraform module for a PostgreSQL-backed Apache Airflow instance☆24Updated 7 years ago
- Example blueprint application for processing high-speed trading data.☆85Updated 2 years ago
- An example Apache Beam project.☆111Updated 8 years ago
- Materials (slides and code) for Kafka and Kafka Streams Workshops☆62Updated last year
- ☆66Updated last year
- Example Kubernetes app that shows how to build a 'pipeline' to stream data into BigQuery. Uses Redis or Google Cloud PubSub☆131Updated 5 years ago
- This is the support code and solutions for the NYC Taxi Tycoon Dataflow Codelab☆63Updated 6 years ago
- This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concu…☆77Updated 7 years ago
- Get started with Apache Beam and Flink☆43Updated 9 years ago
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆70Updated 4 months ago
- Experiments and demonstrations of AVRO, Protobuf serialisation☆61Updated 3 years ago
- Repository for streaming and batch samples of timeseries data☆26Updated 4 years ago
- Mirror of Apache Beam☆10Updated 4 years ago
- Dremio Container Tools☆164Updated 4 months ago
- Airflow workflow management platform chef cookbook.☆70Updated 6 years ago
- Repository with examples and smoke tests for the GCP Airflow operators and hooks☆152Updated 9 years ago
- Bare minimal Airflow on Kubernetes (Local, EKS, AKS)☆53Updated 5 years ago