akarce / e2e-structured-streamingLinks

End-to-end data pipeline that ingests, processes, and stores data. It uses Apache Airflow to schedule scripts that fetch data from an API, sends the data to Kafka, and processes it with Spark before writing to Cassandra. The pipeline, built with Python and Apache Zookeeper, is containerized with Docker for easy deployment and scalability.
20Updated 11 months ago

Alternatives and similar repositories for e2e-structured-streaming

Users that are interested in e2e-structured-streaming are comparing it to the libraries listed below

Sorting: