Spark, Airflow, Kafka
☆24Apr 30, 2023Updated 3 years ago
Alternatives and similar repositories for data-engineering
Users that are interested in data-engineering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository shows my personal notes taken while doing the Udacity Data engineering Nanodegree☆13May 28, 2020Updated 5 years ago
- This is a recipe for docker container based architecture based on airflow, kafka,spark,docker☆19Oct 15, 2024Updated last year
- A self-contained, ready to run Airflow and Kafka project. Can be run locally or within codespaces.☆16Jul 15, 2023Updated 2 years ago
- Data Streaming Nanodegree (from Udacity) exercises, projects and their solutions☆17Aug 14, 2023Updated 2 years ago
- Building Real Time Data Pipeline using Apache Kafka, Apache Spark, Hadoop, PostgreSQL, Django and Flexomonster on Docker to track status …☆24Dec 29, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development☆22Jul 9, 2019Updated 6 years ago
- This is a simple ETL project with Python :)☆40Oct 31, 2022Updated 3 years ago
- ☆10May 24, 2021Updated 4 years ago
- A simple tool for monitoring the progress of OpenFOAM simulations☆13Nov 9, 2018Updated 7 years ago
- This project aims to build a streaming application to perform real-time analytics of Covid-19 related tweets and deploy an ML model for r…☆14Jul 15, 2021Updated 4 years ago
- Some functions to plot OpenFOAM data with Matplotlib☆11Apr 15, 2021Updated 5 years ago
- Chrome Extension for Development/Testing/Exploring GraphQL Servers☆14Oct 1, 2018Updated 7 years ago
- 🚨 Simple, self-contained fraud detection system built with Apache Kafka and Python☆91Apr 29, 2019Updated 7 years ago
- Project for "Data pipeline design patterns" blog.☆51Aug 6, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Airflow ETL for Meetup API☆45Dec 27, 2018Updated 7 years ago
- Data Science for Good links.☆14Nov 10, 2021Updated 4 years ago
- A simple demo showing how to use Ably and fastAPI to route messages into Kafka for stream processing☆16Oct 12, 2021Updated 4 years ago
- ☆11Apr 9, 2022Updated 4 years ago
- Code for my blogs on Data Engineering☆15Nov 9, 2020Updated 5 years ago
- An end-to-end data engineering pipeline to create a dashboard for the latest content on the r/Stocks subreddit☆20Aug 5, 2022Updated 3 years ago
- A way for home buyers to know about factors affecting a state☆48Mar 2, 2019Updated 7 years ago
- Scripts and code written whilst learning and experimenting with machine learning☆13Jul 18, 2022Updated 3 years ago
- using Redis for data science and data engineering☆16Jan 14, 2020Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Just a boilerplate for PySpark and Flask☆36Aug 2, 2018Updated 7 years ago
- This is the official GDSC repo with all of the source code presented in the video tutorials☆13Jun 27, 2023Updated 2 years ago
- ☆11Aug 20, 2024Updated last year
- Building Data Warehouse on BigQuery which takes flat file as the data sources with Airflow as the Orchestrator☆13May 23, 2021Updated 4 years ago
- Docker powered container for using Nginx as reverse-proxy in combination with an OpenVPN Client.☆11Jan 1, 2020Updated 6 years ago
- Solution to all projects of Udacity's Data Engineering Nanodegree: Data Modeling with Postgres & Cassandra, Data Warehouse with Redshift, …☆57Oct 20, 2022Updated 3 years ago
- A python script to convert your youtube URL to an mp3 file and download it to the same directory as the .py file.☆10May 20, 2025Updated last year
- Distributed Data Systems with Azure Databricks, published by Packt☆12Jan 18, 2023Updated 3 years ago
- Deliver Pinpoint Campaigns Driven by Machine Learning on AWS SageMaker☆18Feb 10, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Multi-threaded simple proxy server in Python with file caching☆11Oct 4, 2020Updated 5 years ago
- My MSc project☆14Jun 5, 2011Updated 14 years ago
- Port of cgnsToFoam from TurbMachinery SIG to OpenFOAM 5.x and newer☆16Sep 19, 2023Updated 2 years ago
- All Data Engineering notebooks from Datacamp course☆116Dec 11, 2019Updated 6 years ago
- Natural Language processing in tensorflow☆15Apr 11, 2022Updated 4 years ago
- A repo to track data engineering projects☆13Nov 11, 2022Updated 3 years ago
- Node-RED Flow (and web page example) for the LLaMA AI model☆11Jul 27, 2023Updated 2 years ago