This project provides an end-to-end data processing and visualization of visa numbers in Japan using PySpark and Plotly. The spark clusters are set up within a Docker container on Azure.
☆12Oct 11, 2023Updated 2 years ago
Alternatives and similar repositories for Japan-visa-data-engineering
Users that are interested in Japan-visa-data-engineering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An end-to-end data engineering pipeline that fetches real-time YouTube analytics and streams them through Kafka for processing with ksqlD…☆16Sep 19, 2023Updated 2 years ago
- An end-to-end data engineering pipeline that fetches data from Wikipedia, cleans and transforms it with Apache Airflow and saves it on Az…☆32Oct 2, 2023Updated 2 years ago
- This project showcases how to integrate the world of DevOps, focusing on Continuous Integration (CI) and Continuous Deployment (CD) with …☆14Dec 27, 2023Updated 2 years ago
- This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…☆25Jan 26, 2024Updated 2 years ago
- This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project de…☆11Nov 18, 2023Updated 2 years ago
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆43Jan 4, 2024Updated 2 years ago
- This project shows how to capture changes from postgres database and stream them into kafka☆42May 17, 2024Updated last year
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆45Dec 11, 2023Updated 2 years ago
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆39Dec 18, 2023Updated 2 years ago
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆209Oct 23, 2023Updated 2 years ago
- A data pipeline for processing football data using Python and SQL☆13Sep 12, 2023Updated 2 years ago
- This project leverages Hadoop, Spark, SQL, and Hive for efficient data integration, transformation, warehousing, and analytics. It provid…☆22Sep 30, 2023Updated 2 years ago
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆320Feb 14, 2025Updated last year
- This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessar…☆49Dec 4, 2023Updated 2 years ago
- This project demonstrates how to use Apache Airflow to submit jobs to Apache spark cluster in different programming laguages using Python…☆48Mar 14, 2024Updated 2 years ago
- Implementation of a real-time GPS tracking service with Python and Apache Kafka.☆23May 17, 2020Updated 5 years ago
- ☆14Apr 18, 2024Updated last year
- This is an end to end MLOps system☆34Nov 27, 2025Updated 3 months ago
- Repository to host micro service implementation patterns.☆13Jun 25, 2025Updated 9 months ago
- Query Iceberg in Trino, Nessie as Catalog, and use minio to replace AWS S3☆26Aug 7, 2025Updated 7 months ago
- Sample Spring Boot project implementing a REST CRUD application☆17Oct 19, 2021Updated 4 years ago
- GPT-4o Powered Calorie Detecor☆18May 29, 2024Updated last year
- ☆19Jun 8, 2025Updated 9 months ago
- ☆10Jan 8, 2024Updated 2 years ago
- Automatically backing up your Postgres database using NodeJS☆13Nov 14, 2020Updated 5 years ago
- Udacity Data Engineer Nano Degree - Project-3 (Data Warehouse)☆22Jun 20, 2019Updated 6 years ago
- ☆10Jan 18, 2024Updated 2 years ago
- Transparent sandbox for integration testing against AWS services. Test your infrastructure without changes to your Terraform files or you…☆12Oct 26, 2023Updated 2 years ago
- ☆29Oct 24, 2024Updated last year
- ☆12Jan 31, 2026Updated last month
- ☆29Aug 14, 2025Updated 7 months ago
- Realtime Data Engineering Project☆30Jan 12, 2025Updated last year
- KazeWP is a simple and flexible tool for managing multiple WordPress sites behind a Caddy reverse proxy server. Built with Docker and Bas…☆17Apr 28, 2025Updated 10 months ago
- ☆14Mar 11, 2023Updated 3 years ago
- This formatter which is for handling parameters and file uploaded to Web API controller.☆26Dec 7, 2022Updated 3 years ago
- This is an example of using MongoDB as both a source and sink.☆10May 21, 2020Updated 5 years ago
- THREE JS 2D buttons and labels achorable library☆13Nov 29, 2016Updated 9 years ago
- ☆13Jan 6, 2022Updated 4 years ago
- ☆16Mar 9, 2026Updated 2 weeks ago