cloudacademy / beam
Mirror of Apache Beam
☆10Updated 4 years ago
Alternatives and similar repositories for beam:
Users that are interested in beam are comparing it to the libraries listed below
- ☆12Updated last year
- ☆20Updated 5 years ago
- Multi Cloud Data Tokenization Solution By Using Dataflow and Cloud DLP☆91Updated 8 months ago
- [DEPRECATED] GAE python based app which regularly collects information about GCP resources and stores them in BigQuery☆45Updated last year
- ☆31Updated 6 years ago
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆19Updated 4 years ago
- Example Spark applications that run on Kubernetes and access GCP products, e.g., GCS, BigQuery, and Cloud PubSub☆37Updated 7 years ago
- Create Kafka-Connect clusters with docker . You put the Kafka, we put the Connect.☆25Updated 6 years ago
- Sample code with integration between Data Catalog and Hive data source.☆25Updated 2 months ago
- Optimizing downstream data processing with Amazon Kinesis Data Firehose and Amazon EMR running Apache Spark☆13Updated 2 years ago
- Ansible playbooks for Apache Spark on kube☆27Updated 7 years ago
- ☆66Updated 8 months ago
- Spark Scala docker container sample for AWS testing - EKS & S3☆24Updated 6 years ago
- Relational Database Import to Big Query with Dataflow and DLP API☆18Updated 5 years ago
- Apiary provides modules which can be combined to create a federated cloud data lake☆36Updated last year
- A basic example of how to read and write streaming data using Apache Spark and Kafka on HDInsight☆13Updated 2 years ago
- ☆54Updated 7 years ago
- Scalable CDC Pattern Implemented using PySpark☆18Updated 5 years ago
- Source code for the YouTube video, Apache Beam Explained in 12 Minutes☆21Updated 4 years ago
- Repository for Google Cloud Run Deep Dive☆11Updated 4 years ago
- Samples to help you get started with the AWS Data Exchange API.☆22Updated 5 months ago
- Content and Instructions for completing the "Making Things Right with AWS Lambda and AWS Config Rules" Workshop.☆22Updated 7 years ago
- ☆35Updated 5 years ago
- A Helm Chart for Apache Airflow☆14Updated 6 years ago
- Starting point for the GCP and K8S Continuous Delivery Seed☆15Updated 7 years ago
- A boilerplate project for Azure Big Data PaaS services☆14Updated 2 years ago
- Terraform script for launching multiple EMR clusters for training purposes.☆16Updated last year
- Kubernetes demos☆16Updated 7 years ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Updated last year
- minio as local storage and DynamoDB as catalog☆13Updated 11 months ago