hellokoding / kafka-connect-sink-postgres-with-avro-schemaregistry-pythonLinks
Streaming Data from Kafka to Postgres with Kafka Connect, AVRO, Schema Registry and Python
☆14Updated 7 years ago
Alternatives and similar repositories for kafka-connect-sink-postgres-with-avro-schemaregistry-python
Users that are interested in kafka-connect-sink-postgres-with-avro-schemaregistry-python are comparing it to the libraries listed below
Sorting:
- Kafka Connect connector to stream data in real time from Twitter.☆127Updated 3 years ago
- An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR☆175Updated 6 months ago
- Data validation library for PySpark 3.0.0☆33Updated 3 years ago
- Use Kafka and Apache Spark streaming to perform click stream analytics☆76Updated 5 years ago
- Real-world Spark pipelines examples☆83Updated 7 years ago
- Real-time report dashboard with Apache Kafka, Apache Spark Streaming and Node.js☆50Updated 2 years ago
- How to build an awesome data engineering team☆101Updated 6 years ago
- A hybrid Big Data pipeline architecture that combines a real-time streaming layer with a batch layer to process large datasets(Lambda Arc…☆187Updated 3 months ago
- Fully reproducible, Dockerized, step-by-step, demo on how to stream tables from Postgres to Kafka/KSQL back to Postgres. Detailed blog p…☆152Updated 4 years ago
- Maven quick start for building Kafka Connect connectors.☆147Updated 4 years ago
- Docker container for Kafka - Spark Streaming - Cassandra☆97Updated 6 years ago
- Making Machine Learning Simple and Scalable with Python, Jupyter Notebook, TensorFlow, Keras, Apache Kafka and KSQL☆97Updated 6 years ago
- Repository used for Spark Trainings☆54Updated 2 years ago
- Various Demos mostly based on docker environments☆33Updated 3 years ago
- Kafka Connect connector for CDC data from postgres☆11Updated 8 years ago
- Examples To Help You Learn Apache Spark☆78Updated 7 years ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 11 months ago
- Educational notes,Hands on problems w/ solutions for hadoop ecosystem☆87Updated 6 years ago
- This project describes how to write full ETL data pipeline using spark.☆15Updated 3 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 6 years ago
- ☆63Updated last year
- Apache Spark docker container image (Standalone mode)☆35Updated 5 years ago
- Example Maven configuration for a Spark, Scala project☆54Updated 3 months ago
- Simple way to copy data from relational databases into kafka.☆20Updated 8 years ago
- 🚨 Simple, self-contained fraud detection system built with Apache Kafka and Python☆89Updated 6 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆55Updated 2 years ago
- Airflow training for the crunch conf☆104Updated 7 years ago
- ☆75Updated 5 years ago
- Spark Examples☆126Updated 3 years ago
- Apache Spark Course Material☆96Updated 2 years ago