cloudboxlabs / blog-codeLinks
Cloudbox Labs blog code
☆35Updated 6 years ago
Alternatives and similar repositories for blog-code
Users that are interested in blog-code are comparing it to the libraries listed below
Sorting:
- Spark Scala docker container sample for AWS testing - EKS & S3☆24Updated 6 years ago
- Telecom scenarios implemented with streaming techniques☆11Updated 2 years ago
- Real-time anomaly detection using Kafka, KSQL User Defined Function and a pre-trained model☆30Updated last year
- AWS Lambda function to get events in Kafka topic when files are uploaded to S3☆24Updated 6 years ago
- Kafka Streams based microservice☆25Updated 8 years ago
- Docker Image and Kubernetes Configurations for Spark 2.x☆41Updated 5 years ago
- Automatically loads new partitions in AWS Athena☆19Updated 4 years ago
- Sample Apache Beam pipeline that can be deployed to Amazon Managed Service for Apache Flink. It reads taxi events from a Kinesis data str…☆47Updated last year
- Ansible playbooks for Apache Spark on kube☆27Updated 7 years ago
- Some AWS EMR examples☆16Updated 7 years ago
- ☆45Updated 7 years ago
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- ☆7Updated 9 years ago
- ☆54Updated 7 years ago
- A Helm Chart for Apache Airflow☆14Updated 6 years ago
- Kafka streams microservice on Docker☆15Updated 8 years ago
- Airflow workflow management platform chef cookbook.☆71Updated 5 years ago
- This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concu…☆75Updated 6 years ago
- ☆22Updated 6 years ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆51Updated last week
- Reference architecture for real-time stream processing with Apache Flink on Amazon EMR, Amazon Kinesis, and Amazon Elasticsearch Service.☆72Updated last year
- Some class materials for a data processing course using PySpark☆52Updated 2 years ago
- Apiary provides modules which can be combined to create a federated cloud data lake☆36Updated last year
- An extension for Jupyter notebooks that allows running notebooks inside a Docker container and converting them to runnable Docker images.☆28Updated last year
- This repository is to help with the Partner Demonstration of the Apache Atlas project.☆30Updated 9 years ago
- Spark ETL example processing New York taxi rides public dataset on EKS☆45Updated 2 years ago
- DEPRECATED. PLEASE USE https://github.com/confluentinc/kafka-connect-bigquery. A Kafka Connect BigQuery sink connector☆152Updated last year
- Terraform module for a PostgreSQL-backed Apache Airflow instance☆24Updated 7 years ago
- Node.js kafka connect connector for prometheus☆12Updated 2 years ago
- Reference Architectures for Datalakes on AWS☆79Updated 5 years ago