kaoutaar / end-to-end-etl-pipeline-jcdecaux-APIView external linksLinks
velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docker Compose.
☆20Aug 12, 2025Updated 6 months ago
Alternatives and similar repositories for end-to-end-etl-pipeline-jcdecaux-API
Users that are interested in end-to-end-etl-pipeline-jcdecaux-API are comparing it to the libraries listed below
Sorting:
- Data Pipeline from the Global Historical Climatology Network DataSet☆27Dec 20, 2022Updated 3 years ago
- A Data Engineering Project that implements an ETL data pipeline using Dagster, Apache Spark, Streamlit, MinIO, Metabase, Dbt, Polars, Doc…☆22Nov 19, 2024Updated last year
- Building a Comprehensive Repository of Hromada-Level Data in Ukraine to Facilitate Research and Informed Policy Decisions. This repositor…☆14Dec 23, 2025Updated last month
- ☆30Feb 11, 2024Updated 2 years ago
- Sample project to demonstrate data engineering best practices☆203Feb 24, 2024Updated last year
- A real-time reddit data streaming pipeline for sentiment analysis of various subreddits☆143Aug 23, 2023Updated 2 years ago
- ☆11Dec 28, 2020Updated 5 years ago
- Data Engineering Project to Extract and Process Solana Reddit Data☆40Feb 3, 2024Updated 2 years ago
- ☆10Feb 27, 2024Updated last year
- Conversion of audio files to text using whisper from OpenAI with a simple tkinter GUI☆10Apr 13, 2023Updated 2 years ago
- Houston orchestration API. callhouston.io☆51Jun 16, 2025Updated 8 months ago
- ☆12Nov 18, 2022Updated 3 years ago
- ☆13May 1, 2024Updated last year
- Creating a REST API with Python on Synapse Serverless pools using external tables☆12Dec 27, 2021Updated 4 years ago
- ☆11Aug 20, 2024Updated last year
- Use YOLOv8 with a DJI Tello Drone! Controlled with the Tello app and provides video results post-flight.☆17Dec 8, 2023Updated 2 years ago
- an end-to-end data pipeline extracting music listening habits and producing an insightful dashboard☆17Mar 31, 2024Updated last year
- ZINDI GIZ NLP Agricultural Keyword Spotter 3rd place solution, Audio Classification☆11Sep 8, 2021Updated 4 years ago
- A fully serverless, event-driven data pipeline that ingests, enriches, validates, and visualizes real-time news data using AWS services. …☆24Aug 10, 2025Updated 6 months ago
- Docktor is a Web App that deploys an easy-to-use kit of analysis and scanning tools.☆13Nov 1, 2023Updated 2 years ago
- In this project I have built etl pipline which scraps the trending repository based on month,week and day LIVE extract other related info…☆12Sep 9, 2023Updated 2 years ago
- Function to rotate storage account keys stored in key vault as secret☆13Nov 15, 2023Updated 2 years ago
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆49Dec 2, 2023Updated 2 years ago
- Spark Structured Streaming data pipeline that processes movie ratings data in real-time.☆13Feb 11, 2026Updated last week
- ZMK firmware for Urchin and Corne 36 keyboard with nice!nano and nice!view☆17Jan 16, 2026Updated last month
- Microsoft 365 Defender Hunting via PowerShell.☆14Feb 8, 2022Updated 4 years ago
- Instructions and code for the workshop "From Big Data to NLP Insights: Unlocking the Power of PySpark and Spark NLP"☆12May 9, 2023Updated 2 years ago
- ☆14Sep 19, 2025Updated 4 months ago
- Data pipeline that scrapes Rust cheater Steam profiles☆54Feb 13, 2022Updated 4 years ago
- Code Repository for my 3rd Data Project.☆16Jun 13, 2023Updated 2 years ago
- Automate Budget Planning with Linear Programming☆14Jan 3, 2026Updated last month
- streaming eight subreddits from reddit api using kafka producer & spark structured streaming.☆19Updated this week
- End to end data engineering project☆58Oct 27, 2022Updated 3 years ago
- Leveraging Hortonworks' HDP 3.1.0 and HDF 3.4.0 components, this tutorial guides the user through steps to stream data from a REST API in…☆19Aug 16, 2019Updated 6 years ago
- This repository contains notebooks with different probability density function estimators.☆13Jun 4, 2020Updated 5 years ago
- My first attempt at a rough ETL pipeline; technologies include spark, GCS, prefect orchestration, and terraform☆14Oct 12, 2022Updated 3 years ago
- ☆21Apr 21, 2025Updated 9 months ago
- StarCraft 2 Data Pipeline with Airflow, DuckDB and Streamlit☆17Mar 14, 2024Updated last year
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…☆75Sep 2, 2023Updated 2 years ago