used Airflow, Postgres, Kafka, Spark, and Cassandra, and GitHub Actions to establish an end-to-end data pipeline
☆32Oct 25, 2023Updated 2 years ago
Alternatives and similar repositories for Data-Streaming-Project
Users that are interested in Data-Streaming-Project are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the Data Engineering Zoomcamp☆20Dec 12, 2022Updated 3 years ago
- Scan and monitor your network effortlessly! Nmap Prometheus Exporter provides insights into network health and security with Prometheus-c…☆15Oct 2, 2023Updated 2 years ago
- Deploy a complete data stack in just a couple of minutes.☆15Mar 6, 2024Updated 2 years ago
- ☆23May 13, 2025Updated last year
- 🚂 Fine-tune OpenAI models for text classification, question answering, and more☆17May 1, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Repositório dedicado a Workshop de Data Lakehouse com Delta Lake☆17Dec 6, 2021Updated 4 years ago
- ☆12Mar 6, 2021Updated 5 years ago
- Data Engineer Project: An end-to-end Airflow data pipeline with BigQuery, dbt Soda, and more!☆14Dec 14, 2023Updated 2 years ago
- Motion Software Development Kit (SDK)☆15Dec 14, 2022Updated 3 years ago
- ☆66Aug 6, 2024Updated last year
- A well-documented explanation of data structure types including Linked List, Hash table, Binary Tree, Queues, Stack☆13Jul 30, 2022Updated 3 years ago
- Heard of Machine Learning? What's it all about? 🤷🏾♀️ This repo will contain tutorials on different models required to give you an in…☆15Nov 14, 2017Updated 8 years ago
- Static site to create tactical animations☆10Sep 2, 2021Updated 4 years ago
- Demonstration of LLM integration into a lex bot using Lambda codehooks and a Sagemaker endpoint.☆14Dec 20, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆51Dec 2, 2023Updated 2 years ago
- Docker Apache Airflow☆13Mar 1, 2023Updated 3 years ago
- Comprehensive tutorial and toolkit for football data webscraping using Python, Selenium, and various scraping techniques☆63May 1, 2026Updated 2 months ago
- A little steganography. Hiding text or file inside an image using LSB method☆13Nov 15, 2019Updated 6 years ago
- ☆15Mar 14, 2024Updated 2 years ago
- ☆24Dec 4, 2023Updated 2 years ago
- A backtest a day keeps the losses away!☆15Sep 11, 2023Updated 2 years ago
- A simple, customizable, and modern library for displaying alert banners in your Jetpack Compose, Compose Multiplatform and native iOS (Sw…☆58Apr 10, 2026Updated 2 months ago
- A data pipeline moving data from a Relational database system (RDBMS) to a Hadoop file system (HDFS).☆15Jun 3, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆46Dec 2, 2025Updated 7 months ago
- ☆24Jul 21, 2022Updated 3 years ago
- Glue ETL job or EMR Spark that gets from data catalog, modifies and uploads to S3 and Data Catalog☆13Aug 26, 2023Updated 2 years ago
- It is a assemble to include all Practice Projects about Big Data Topic, includes Hadoop, Spark, Spark Streaming and Kafka☆11Mar 7, 2019Updated 7 years ago
- This repo is for the Linkedin Learning course: End-to-End Data Engineering Project☆35Nov 9, 2023Updated 2 years ago
- A platform that helps developers to better understand CSS through declaration interpretation and may even improve them through suggestion…☆14Jul 3, 2021Updated 5 years ago
- TTS utility☆12Aug 2, 2020Updated 5 years ago
- ☆29Apr 24, 2026Updated 2 months ago
- Spark Notebook docker image☆10Dec 29, 2017Updated 8 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆21Aug 12, 2025Updated 10 months ago
- A lightweight, open-source UI for dbt that provides model browsing, lineage visualization, run orchestration, documentation previews, and…☆63May 30, 2026Updated last month
- This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project de…☆12Nov 18, 2023Updated 2 years ago
- 🚀 A simple javascript template for rapid development of GitHub actions.☆17Feb 24, 2023Updated 3 years ago
- Ecommerce Realtime Data Pipeline (Data Modeling, Workflow Orchestration, Change Data Capture, Analytical Database and Dashboarding)☆69Mar 9, 2024Updated 2 years ago
- Flask based Movie Recommendation System☆12May 1, 2023Updated 3 years ago
- I will share DSA notes and code here☆19Mar 24, 2023Updated 3 years ago