daddydrac / PySpark-Confluent-Kafka-Apache-Drill-
View external linksLinks

A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill using Docker and Cassandra (NoSQL DB) for storage; This allows for for fast feature engineering and data cleaning.
28Jul 8, 2019Updated 6 years ago

Alternatives and similar repositories for PySpark-Confluent-Kafka-Apache-Drill-

Users that are interested in PySpark-Confluent-Kafka-Apache-Drill- are comparing it to the libraries listed below

Sorting:

Are these results useful?