Produce Kafka messages, consume them and upload into Cassandra, MongoDB.
☆43Sep 26, 2023Updated 2 years ago
Alternatives and similar repositories for airflow_kafka_cassandra_mongodb
Users that are interested in airflow_kafka_cassandra_mongodb are comparing it to the libraries listed below
Sorting:
- Writes the CSV file to Postgres, read table and modify it. Write more tables to Postgres with Airflow.☆37Sep 1, 2023Updated 2 years ago
- Glue ETL job or EMR Spark that gets from data catalog, modifies and uploads to S3 and Data Catalog☆13Aug 26, 2023Updated 2 years ago
- Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra☆145Jul 27, 2023Updated 2 years ago
- ☆46Jul 6, 2024Updated last year
- Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple …☆30Aug 26, 2020Updated 5 years ago
- This project demonstrates real-time data streaming and processing architecture using Kafka, Spark Streaming, and Debezium for capturing C…☆13Oct 24, 2024Updated last year
- Docker Apache Airflow☆13Mar 1, 2023Updated 3 years ago
- An AWS Data Engineering End-to-End Project (Glue, Lambda, Kinesis, Redshift, QuickSight, Athena, EC2, S3)☆16Sep 20, 2023Updated 2 years ago
- Code relating to the Coursera Bioinformatics Specialization as well as my own genetic algorithm experiment.☆11Apr 19, 2019Updated 6 years ago
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆65Jul 21, 2023Updated 2 years ago
- ☆18Nov 9, 2025Updated 4 months ago
- 👾 This repository contains files related to my personal website. Charts, Jupyter notebooks, random notes, etc.☆18Oct 31, 2024Updated last year
- This repository about how to deploy machine learning model end serving with FastAPI and using MLFlow-MINIO☆18Jun 11, 2023Updated 2 years ago
- Content for a talk on "The wonderful world of data quality tools in Python"☆18May 5, 2021Updated 4 years ago
- Analytics engineering with dbt - projects and developer environment☆22Sep 27, 2024Updated last year
- ☆23Jun 12, 2023Updated 2 years ago
- Udacity Data Streaming Nanodegree Program