dogukannulu / airflow_kafka_cassandra_mongodbView external linksLinks
Produce Kafka messages, consume them and upload into Cassandra, MongoDB.
☆43Sep 26, 2023Updated 2 years ago
Alternatives and similar repositories for airflow_kafka_cassandra_mongodb
Users that are interested in airflow_kafka_cassandra_mongodb are comparing it to the libraries listed below
Sorting:
- Writes the CSV file to Postgres, read table and modify it. Write more tables to Postgres with Airflow.☆38Sep 1, 2023Updated 2 years ago
- Glue ETL job or EMR Spark that gets from data catalog, modifies and uploads to S3 and Data Catalog☆13Aug 26, 2023Updated 2 years ago
- Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra☆144Jul 27, 2023Updated 2 years ago
- ☆46Jul 6, 2024Updated last year
- Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple …☆30Aug 26, 2020Updated 5 years ago
- This project demonstrates real-time data streaming and processing architecture using Kafka, Spark Streaming, and Debezium for capturing C…☆13Oct 24, 2024Updated last year
- Docker Apache Airflow☆13Mar 1, 2023Updated 2 years ago
- An AWS Data Engineering End-to-End Project (Glue, Lambda, Kinesis, Redshift, QuickSight, Athena, EC2, S3)☆16Sep 20, 2023Updated 2 years ago
- Code relating to the Coursera Bioinformatics Specialization as well as my own genetic algorithm experiment.☆11Apr 19, 2019Updated 6 years ago
- A self-contained, ready to run Airflow and Kafka project. Can be run locally or within codespaces.☆16Jul 15, 2023Updated 2 years ago
- ☆18Nov 9, 2025Updated 3 months ago
- This repository about how to deploy machine learning model end serving with FastAPI and using MLFlow-MINIO☆18Jun 11, 2023Updated 2 years ago
- 👾 This repository contains files related to my personal website. Charts, Jupyter notebooks, random notes, etc.☆18Oct 31, 2024Updated last year
- Content for a talk on "The wonderful world of data quality tools in Python"☆18May 5, 2021Updated 4 years ago
- ☆23Jun 12, 2023Updated 2 years ago
- Analytics engineering with dbt - projects and developer environment☆22Sep 27, 2024Updated last year
- A pipeline to detect data drift and retrain the model when there is drift☆24Aug 3, 2023Updated 2 years ago
- Udacity Data Streaming Nanodegree Program☆24Feb 20, 2021Updated 4 years ago
- IBM Data Engineering Professional Certificate☆32May 10, 2025Updated 9 months ago
- Realtime Data Engineering Project☆30Jan 12, 2025Updated last year
- A partially implemented ODBC driver for the Trino distributed SQL engine☆18Feb 2, 2026Updated 2 weeks ago
- ☆10Sep 20, 2020Updated 5 years ago
- ☆10Jul 24, 2022Updated 3 years ago
- Python library for the simulation of probabilistic circuits.☆11Feb 1, 2026Updated 2 weeks ago
- ฝึกนักสร้างเว็บไซต์ จาก ผู้เริ่มต้น ไปเป็น มือโปร☆15Nov 26, 2023Updated 2 years ago
- Framework for studying cryptographic hash functions using SAT.☆10Dec 21, 2021Updated 4 years ago
- Automated Continuous Data Quality Measurement☆12Nov 15, 2023Updated 2 years ago
- This project shows how to capture changes from postgres database and stream them into kafka☆41May 17, 2024Updated last year
- A tutorial for using YOLACT in Google Colab☆36Jan 9, 2020Updated 6 years ago
- Code I use in my medium blog/articles.☆39Jun 9, 2025Updated 8 months ago
- Collated optimization models from numerous sources☆13Jun 18, 2023Updated 2 years ago
- Movie Reviews Sentiment Analysis☆12Jun 28, 2018Updated 7 years ago
- ☆10Apr 18, 2024Updated last year
- Intended for internal use: deploys all infrastructure required for Astronomer to run on GCP☆13May 6, 2025Updated 9 months ago
- FastPy-RS is a high-performance Python library that provides optimized implementations of common functions using Rust.☆18Aug 19, 2025Updated 5 months ago
- ☆11Feb 20, 2016Updated 9 years ago
- ☆15Apr 1, 2025Updated 10 months ago
- ☆10Jun 22, 2022Updated 3 years ago
- Generative Adversarial Networks☆10Feb 2, 2023Updated 3 years ago