DhiaTN / confluent-avro-py
An Avro SerDe implementation that integrates with the confluent schema registry and serializes and deserializes data according to the defined confluent wire format
☆22Updated 4 years ago
Alternatives and similar repositories for confluent-avro-py:
Users that are interested in confluent-avro-py are comparing it to the libraries listed below
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 3 months ago
- Experiments and demonstrations of AVRO, Protobuf serialisation☆60Updated 2 years ago
- Minikube for big data with Scala and Spark☆15Updated 5 years ago
- Source code for the YouTube video, Apache Beam Explained in 12 Minutes☆21Updated 4 years ago
- Quickly get a kubernetes executor airflow environment provisioned on GKE. Azure Kubernetes Service instructions included also as are inst…☆36Updated 4 years ago
- A pyspark lib to validate data quality☆18Updated 2 years ago
- Composable filesystem hooks and operators for Apache Airflow.☆17Updated 3 years ago
- Bare minimal Airflow on Kubernetes (Local, EKS, AKS)☆53Updated 5 years ago
- Repository with examples and smoke tests for the GCP Airflow operators and hooks☆147Updated 8 years ago
- A curated list of awesome resources for Apache Beam☆145Updated 2 years ago
- Tools for creating Dataproc custom images☆32Updated last week
- Curated by Lenses.io☆308Updated 4 years ago
- Fast iterative local development and testing of Apache Airflow workflows☆200Updated last week
- A simple demonstration of an event driven microservice written in Python☆79Updated 9 years ago
- Ansible role to deploy and configure Airflow☆41Updated 3 weeks ago
- Setting up Apache Airflow on GKE☆23Updated 7 years ago
- ☆10Updated 6 years ago
- Contains example dags and terraform code to create a composer with a node pool to run pods☆13Updated 4 years ago
- BigQuery test kit is a framework written in python that allows you to be more confident in your SQL and check that they are ready to prod…☆52Updated last year
- ☆20Updated 5 years ago
- ☆54Updated 7 years ago
- The Python fake data producer for Apache Kafka® is a complete demo app allowing you to quickly produce JSON fake streaming datasets and …☆85Updated 11 months ago
- ETLy is an add-on dashboard service on top of Apache Airflow.☆69Updated last year
- Code to be contributed to the Apache Airflow (incubating) project for ETL workflow management for integrating with the Snowflake Data War…☆25Updated 7 years ago
- Airflow plugin to transfer arbitrary files between operators☆78Updated 6 years ago
- Just a boilerplate for PySpark and Flask☆35Updated 6 years ago
- Collection of transforms for the Apache beam python SDK.☆89Updated last year
- Use Airflow to move data from multiple MySQL databases to BigQuery☆100Updated 4 years ago
- Cloud Spanner Connector for Apache Spark☆17Updated 3 months ago
- Prometheus Exporter for Airflow☆160Updated 10 months ago