longNguyen010203 / Youtube-Recommend-Master-ETL-PipelineView external linksLinks
A Data Engineering Project that implements an ETL data pipeline using Dagster, Apache Spark, Streamlit, MinIO, Metabase, Dbt, Polars, Docker. Data from kaggle and youtube-api
☆22Nov 19, 2024Updated last year
Alternatives and similar repositories for Youtube-Recommend-Master-ETL-Pipeline
Users that are interested in Youtube-Recommend-Master-ETL-Pipeline are comparing it to the libraries listed below
Sorting:
- This project implements an ELT (Extract - Load - Transform) data pipeline with the goodreads dataset, using dagster (orchestration), spar…☆42Apr 22, 2023Updated 2 years ago
- Scan and monitor your network effortlessly! Nmap Prometheus Exporter provides insights into network health and security with Prometheus-c…☆15Oct 2, 2023Updated 2 years ago
- A quickstart tool for creating a FastAPI project with Jinja2, TailwindCSS, Flowbite, HTMX, and AlpineJS.☆13Jun 23, 2025Updated 7 months ago
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆20Aug 12, 2025Updated 6 months ago
- End-to-end ELT data engineering project☆22Dec 24, 2022Updated 3 years ago
- DataTalks.Club's Data Engineering Zoomcamp Project☆24Jul 14, 2022Updated 3 years ago
- A end-to-end real-time stock market data pipeline with Python, AWS EC2, Apache Kafka, and Cassandra Data is processed on AWS EC2 with Apa…☆27Jun 7, 2023Updated 2 years ago
- My Setup Development Environment as Data Engineer☆35Aug 5, 2025Updated 6 months ago
- Rethinking the User Interface of AI☆30Updated this week
- Simple ETL pipeline using Python☆29May 22, 2023Updated 2 years ago
- Telegram bot that manages creation of queues / attendance lists for periodic events.☆15Dec 23, 2024Updated last year
- Playable synthesizer created with Tone.js, Next.js, and React.☆10Aug 14, 2022Updated 3 years ago
- ☆16Oct 8, 2025Updated 4 months ago
- This project aims to build a traveling recommendation application using Google Places API and OpenAI LLM.☆11Mar 19, 2024Updated last year
- ☆40Mar 9, 2025Updated 11 months ago
- OpenExchange web application☆16Apr 5, 2025Updated 10 months ago
- Zero-Config Terraform Module to deploy Next.js Apps on AWS using Serverless resources☆12Dec 4, 2025Updated 2 months ago
- Simple terminal UI for managing /etc/hosts files!☆17Aug 18, 2024Updated last year
- csv to parquet and vice versa file converter based on Pandas written in Python3☆10Mar 23, 2021Updated 4 years ago
- ☆10Feb 27, 2024Updated last year
- A simple javascript library for reading glucose data from the Dexcom Share API.☆10Aug 11, 2023Updated 2 years ago
- GraphQL API for cloud pricing. Contains over 3M public prices from AWS, Azure and GCP. Self-updates prices via an automated weekly job.☆18Updated this week
- Program summarizes news articles into a couple of sentences. This project was inspired by SMMRY, the algorithm used in many subreddits to…☆10Jan 15, 2019Updated 7 years ago
- My solutions for the Udacity Data Engineering Nanodegree☆34Oct 14, 2019Updated 6 years ago
- This web extension allows users to navigate Glassdoor while logged out.☆13Jan 15, 2025Updated last year
- Cool DE Projects☆60Dec 23, 2025Updated last month
- An example of a Dagster project with a possible folder structure to organize the assets, jobs, repositories, schedules, and ops. Also has…☆102Nov 3, 2024Updated last year
- Fritz!Box tool set, e.g. parsing the call monitor, adding phone entries to phonebook, auto-blocking calls etc.☆11Feb 19, 2023Updated 2 years ago
- Modern RDAP lookup tool with 14+ cybersecurity analysis tools. Domain intelligence, SSL/TLS analysis, threat detection, email security …☆16Jan 31, 2026Updated 2 weeks ago
- 🚀 Complete AWS learning path for beginners - 45K+ community resource with hands-on labs, workshops, and certification guides☆18Oct 17, 2024Updated last year
- Acquiring and processing information on world's largest banks☆17Jun 17, 2025Updated 8 months ago
- Project on belief embedding☆20Jun 4, 2025Updated 8 months ago
- terrajux diffs the source code of a terraform project and all of its transitive module dependencies between two git refs.☆11Jun 14, 2021Updated 4 years ago
- MadflixGPT is an innovative movie information platform powered by AI, offering personalized movie recommendations based on any query. Exp…☆14Apr 8, 2024Updated last year
- Quickly deploy k3s onto 4 'free forever' ARM VMs on Oracle Cloud☆10Dec 20, 2024Updated last year
- Faster News repository: A swift, modern website using Svelte (front-end) and Go (back-end) for optimal performance☆16Jan 27, 2024Updated 2 years ago
- Python Pydantic model of the Prometheus Alertmanager alert payload☆10Jul 24, 2023Updated 2 years ago
- Udacity AWS DevOps Capstone project☆11Apr 9, 2021Updated 4 years ago
- This project provides an end-to-end data processing and visualization of visa numbers in Japan using PySpark and Plotly. The spark cluste…☆12Oct 11, 2023Updated 2 years ago