☆46Jul 6, 2024Updated last year
Alternatives and similar repositories for Data-Engineering-Streaming-Project
Users that are interested in Data-Engineering-Streaming-Project are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10May 5, 2022Updated 4 years ago
- Courses and projects on Data Camp☆11Jun 28, 2020Updated 5 years ago
- Companion repository that goes along with Snowflake's "Advanced Data Engineering with Snowflake" course☆35Apr 23, 2025Updated last year
- ☆15Sep 9, 2023Updated 2 years ago
- Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra☆146Jul 27, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Quickstart to Cilium☆17Oct 1, 2025Updated 8 months ago
- ☆30Nov 16, 2023Updated 2 years ago
- Code for the Data Engineering Zoomcamp☆20Dec 12, 2022Updated 3 years ago
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆43Sep 26, 2023Updated 2 years ago
- ☆13May 11, 2025Updated last year
- used Airflow, Postgres, Kafka, Spark, and Cassandra, and GitHub Actions to establish an end-to-end data pipeline☆32Oct 25, 2023Updated 2 years ago
- Contains spark dataframe solutions of leetcode questions☆24Dec 13, 2022Updated 3 years ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆44Dec 28, 2022Updated 3 years ago
- ☆21Jan 13, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…☆25Jan 26, 2024Updated 2 years ago
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆115Jan 8, 2026Updated 5 months ago
- Example Flink and Kafka integration project☆15Nov 28, 2015Updated 10 years ago
- Covid19 and Iowa Liquor Sales analysis at BigQuery using dbt, Airflow, Marquez, Google Cloud and other modern data stack tools☆14Jun 18, 2022Updated 3 years ago
- Content for a talk on "The wonderful world of data quality tools in Python"☆18May 5, 2021Updated 5 years ago
- ☆13Oct 28, 2025Updated 7 months ago
- Куски кода и приемы, которые часто переиспользую☆16Jan 3, 2024Updated 2 years ago
- Docker with Airflow and Spark standalone cluster☆264Aug 5, 2023Updated 2 years ago
- Monotonic Optimal Binning algorithm is a statistical approach to transform continuous variables into optimal and monotonic categorical va…☆20May 31, 2026Updated last week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Apache Flink/Apache Kafka streaming data analytics demonstration using Streaming Synthetic Sales Data Generator☆15Jun 4, 2024Updated 2 years ago
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆26Nov 8, 2022Updated 3 years ago
- Dockerfile for OpenLogReplicator☆21Mar 3, 2026Updated 3 months ago
- ☆12Mar 6, 2021Updated 5 years ago
- The goal of this project is to analyse the impact of Covid-19 on the Aviation industry through data engineering processes using technolog…☆13Jun 26, 2022Updated 3 years ago
- Example of how to build machine learning training workflow on AWS by Prefect☆12Nov 2, 2022Updated 3 years ago
- ☆16May 29, 2023Updated 3 years ago
- This project is focused on the Deployment phase of machine learning. The Docker and FastAPI are used to deploy a dockerized server of tra…☆27Jan 7, 2023Updated 3 years ago
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, duckdb and Superset☆49Apr 5, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Deploy of Airflow 2.0 using ECS Fargate and AWS CDK.☆14Nov 5, 2021Updated 4 years ago
- This is the first project where we worked on apache spark, In this project what we have done is that we downloaded the datasets from KAGG…☆23Oct 14, 2021Updated 4 years ago
- AI enhanced automation tool for financial modelling and market analysis.☆12Sep 10, 2019Updated 6 years ago
- Code for Tajima et al. (2019). Optimal policy for multi-alternative decisions. Nature Neuroscience.☆10Aug 23, 2019Updated 6 years ago
- Implementation of Few-shot Binary Image Classification using Contrastive Learning-based Approach in PyTorch☆11May 1, 2023Updated 3 years ago
- Series follows learning from Apache Spark (PySpark) with quick tips and workaround for daily problems in hand☆55Sep 30, 2023Updated 2 years ago
- Intelligent Water Drops Algorithm for TSP.☆10Dec 12, 2020Updated 5 years ago