airscholar/Japan-visa-data-engineering

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/airscholar/Japan-visa-data-engineering)

airscholar / Japan-visa-data-engineering

This project provides an end-to-end data processing and visualization of visa numbers in Japan using PySpark and Plotly. The spark clusters are set up within a Docker container on Azure.

☆11

Alternatives and similar repositories for Japan-visa-data-engineering

Users that are interested in Japan-visa-data-engineering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

airscholar / YoutubeAnalytics
View on GitHub
An end-to-end data engineering pipeline that fetches real-time YouTube analytics and streams them through Kafka for processing with ksqlD…
☆16Sep 19, 2023Updated 2 years ago
airscholar / FootballDataEngineering
View on GitHub
An end-to-end data engineering pipeline that fetches data from Wikipedia, cleans and transforms it with Apache Airflow and saves it on Az…
☆30Oct 2, 2023Updated 2 years ago
airscholar / cicd_for_data_engineering
View on GitHub
This project showcases how to integrate the world of DevOps, focusing on Continuous Integration (CI) and Continuous Deployment (CD) with …
☆14Dec 27, 2023Updated 2 years ago
airscholar / ApacheFlink-SalesAnalytics
View on GitHub
This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project de…
☆12Nov 18, 2023Updated 2 years ago
airscholar / Kubernetes-For-DataEngineering
View on GitHub
This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…
☆25Jan 26, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
airscholar / RealtimeStreamingEngineering
View on GitHub
This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…
☆44Jan 4, 2024Updated 2 years ago
airscholar / changecapture-e2e
View on GitHub
This project shows how to capture changes from postgres database and stream them into kafka
☆41May 17, 2024Updated 2 years ago
airscholar / realtime-voting-data-engineering
View on GitHub
This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…
☆48Dec 11, 2023Updated 2 years ago
airscholar / modern-data-eng-dbt-databricks-azure
View on GitHub
In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …
☆38Dec 18, 2023Updated 2 years ago
sdw-online / python_sql_football_data_pipeline
View on GitHub
A data pipeline for processing football data using Python and SQL
☆13Sep 12, 2023Updated 2 years ago
confluentinc / demo-database-modernization
View on GitHub
This demo shows how to stream data to cloud databases with Confluent. It includes fully-managed connectors (Oracle CDC, RabbitMQ, MongoDB…
☆11Jan 10, 2025Updated last year
airscholar / e2e-data-engineering
View on GitHub
An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…
☆336Feb 14, 2025Updated last year
airscholar / FlinkCommerce
View on GitHub
This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessar…
☆51Dec 4, 2023Updated 2 years ago
Al-Moatasem / ETL-Telecom-SSIS
View on GitHub
☆16Dec 30, 2020Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
airscholar / RedditDataEngineering
View on GitHub
This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…
☆225Oct 23, 2023Updated 2 years ago
airscholar / SparkingFlow
View on GitHub
This project demonstrates how to use Apache Airflow to submit jobs to Apache spark cluster in different programming laguages using Python…
☆48Mar 14, 2024Updated 2 years ago
Rohanvp07 / Covid-19-Analysis-and-Prediction
View on GitHub
☆16Feb 20, 2026Updated 5 months ago
wirelessr / trino-iceberg-playground
View on GitHub
Query Iceberg in Trino, Nessie as Catalog, and use minio to replace AWS S3
☆27Aug 7, 2025Updated 11 months ago
Mouhamed-Jinja / Hadoop-Docker-Spark-Sql-Hive-Data-Integration-and-Warehousing-Project
View on GitHub
This project leverages Hadoop, Spark, SQL, and Hive for efficient data integration, transformation, warehousing, and analytics. It provid…
☆24Sep 30, 2023Updated 2 years ago
jukkakansanaho / udacity-dend-project-3
View on GitHub
Udacity Data Engineer Nano Degree - Project-3 (Data Warehouse)
☆22Jun 20, 2019Updated 7 years ago
maxmekiska / micro-templates
View on GitHub
Repository to host micro service implementation patterns.
☆14Jun 25, 2025Updated last year
unconv / calorieapp
View on GitHub
GPT-4o Powered Calorie Detecor
☆18May 29, 2024Updated 2 years ago
mguay22 / workers-pm2
View on GitHub
☆10Jan 8, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Bradleykingz / automated-postgres-backups-with-node
View on GitHub
Automatically backing up your Postgres database using NodeJS
☆13Nov 14, 2020Updated 5 years ago
tushar2704 / ML-Portfolio
View on GitHub
This repository showcases a collection of machine learning projects in various domains, demonstrating my skills and expertise as a data s…
☆12Nov 20, 2023Updated 2 years ago
Azure-Samples / azure-sql-db-ai-samples-search
View on GitHub
To gain access, please finish setting up this repository now at:
☆35Mar 23, 2026Updated 4 months ago
justashton / justashton
View on GitHub
☆30Aug 14, 2025Updated 11 months ago
thuytv-gl / fabric-CJK-vertical
View on GitHub
☆10Jan 18, 2024Updated 2 years ago
muhammetbektas / spark_clickhouse_streaming
View on GitHub
Realtime Data Engineering Project
☆31Jan 12, 2025Updated last year
Muhammadatef / ArsenalFC-Data-Pipeline-Project
View on GitHub
☆29Oct 24, 2024Updated last year
luqasn / aws-sandbox
View on GitHub
Transparent sandbox for integration testing against AWS services. Test your infrastructure without changes to your Terraform files or you…
☆12Oct 26, 2023Updated 2 years ago
malvik01 / Earthquake-Data-Engineering-Project-with-Microsoft-Fabric
View on GitHub
☆16Apr 18, 2024Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
codedecode25 / Microservices_vaccination_citizen
View on GitHub
☆13May 18, 2026Updated 2 months ago
mwanago / nestjs-dockerized
View on GitHub
☆17Mar 10, 2023Updated 3 years ago
ShivaBhattacharjee / https-server-golang
View on GitHub
☆12Jan 31, 2026Updated 5 months ago
bezkoder / angular-14-refresh-token
View on GitHub
Angular JWT refresh token with Interceptor, handle token expiration in Angular 14 - Refresh token before expiration example
☆13Sep 20, 2022Updated 3 years ago
10h30 / kazewp
View on GitHub
KazeWP is a simple and flexible tool for managing multiple WordPress sites behind a Caddy reverse proxy server. Built with Docker and Bas…
☆17Apr 28, 2025Updated last year
ongxuanhong / de02-pyspark-optimization
View on GitHub
☆14Mar 11, 2023Updated 3 years ago
RWaltersMA / mongo-source-sink
View on GitHub
This is an example of using MongoDB as both a source and sink.
☆10May 21, 2020Updated 6 years ago