cloudboxlabs / blog-code
Cloudbox Labs blog code
☆35Updated 6 years ago
Alternatives and similar repositories for blog-code:
Users that are interested in blog-code are comparing it to the libraries listed below
- Spark Scala docker container sample for AWS testing - EKS & S3☆24Updated 6 years ago
- Some AWS EMR examples☆16Updated 7 years ago
- Docker Image and Kubernetes Configurations for Spark 2.x☆41Updated 5 years ago
- Real-time anomaly detection using Kafka, KSQL User Defined Function and a pre-trained model☆30Updated last year
- AWS Lambda function to get events in Kafka topic when files are uploaded to S3☆24Updated 6 years ago
- Ansible playbooks for Apache Spark on kube☆27Updated 7 years ago
- Docker container for Kafka - Spark Streaming - Cassandra☆98Updated 5 years ago
- This repo demonstrates how to load a sample Parquet formatted file from an AWS S3 Bucket. A python job will then be submitted to a Apach…☆19Updated 8 years ago
- ☆7Updated 9 years ago
- Terraform module for a PostgreSQL-backed Apache Airflow instance☆24Updated 6 years ago
- Cloudformation templates for deploying Airflow in ECS☆40Updated 6 years ago
- Mirror of Apache Beam☆10Updated 4 years ago
- Minikube for big data with Scala and Spark☆15Updated 5 years ago
- This package can pull random public tweets from Twitter (tweets-streaming.py) or generates simulated tweets (tweets-simulated.py). The re…☆24Updated 5 years ago
- Some class materials for a data processing course using PySpark☆52Updated 2 years ago
- Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.☆28Updated 7 years ago
- Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline☆75Updated 2 years ago
- Example blueprint application for processing high-speed trading data.☆84Updated last year
- ☆54Updated 7 years ago
- Kafka Connect connector to stream data in real time from Twitter.☆126Updated 2 years ago
- Materials (slides and code) for Kafka and Kafka Streams Workshops☆62Updated 10 months ago
- Reference architecture for real-time stream processing with Apache Flink on Amazon EMR, Amazon Kinesis, and Amazon Elasticsearch Service.☆71Updated last year
- Bash completion for Kafka command line utilities.☆34Updated 7 years ago
- Time series analysis with Apache Spark based on Chronix |☆38Updated 8 years ago
- Cloudformation and SQL scripts used to replicate a POC environment from the "Data Lake to Data Warehouse: Enhancing Customer 360 with Ama…☆31Updated 5 years ago
- Real-Time Data Processing Pipeline & Visualization with Docker, Spark, Kafka and Cassandra☆84Updated 8 years ago
- In-deprecation. For Lenses please check lensesio/lenses-helm-charts. Soon Stream Reactor will also get its own Helm repository.☆70Updated 4 years ago
- ☆16Updated 8 years ago
- NiFi Bundle for FIX Protocol☆16Updated 7 years ago
- ☆45Updated 7 years ago