rohitrsp898/Basic_ETL_PySpark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rohitrsp898/Basic_ETL_PySpark)

rohitrsp898 / Basic_ETL_PySpark

☆21

Alternatives and similar repositories for Basic_ETL_PySpark

Users that are interested in Basic_ETL_PySpark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

skoonData / docker-compose
View on GitHub
☆12Jul 27, 2021Updated 4 years ago
datacoves / snowcap
View on GitHub
Snowcap - Snowflake infrastructure-as-code. Provision Snowflake resources, Manage RBAC, users, roles, and grants.
☆15Jul 10, 2026Updated 2 weeks ago
dynonguyen / Data-Warehouse-UKAccident
View on GitHub
Information system for business project - building and mining data warehouse
☆10Jan 11, 2022Updated 4 years ago
ahmedsami76 / spark
View on GitHub
Repo for Spark tutorial
☆14Jan 1, 2025Updated last year
NikhilDhiman / SQOOP-Automation
View on GitHub
A shell script to automate the operations of sqoop
☆11Mar 29, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
datamarts / prostore
View on GitHub
Project is in active development and has been moved to https://repository.datamart.ru/datamarts/prostore.
☆17Apr 22, 2022Updated 4 years ago
d-one / d-one-mlops-aws
View on GitHub
Repository for the D ONE MLOps AWS BlogPost
☆10May 5, 2026Updated 2 months ago
chiyahn / rMSWITCH
View on GitHub
R package for Markov regime-switching models
☆12Jan 23, 2018Updated 8 years ago
ketgo / marshmallow-pyspark
View on GitHub
Marshmallow serializer integration with pyspark
☆12Dec 29, 2023Updated 2 years ago
sahilbhange / spark-slowly-changing-dimension
View on GitHub
Spark implementation of Slowly Changing Dimension type 2
☆11Jan 8, 2019Updated 7 years ago
harshithvarmapothuri / ML-Algorithms
View on GitHub
☆14Apr 25, 2023Updated 3 years ago
MTSWebServices / onetl
View on GitHub
One ETL tool to rule them all
☆89Updated this week
Gsonggit / stockwell_transform
View on GitHub
origin S_transform matlab code transfered to Python
☆20Mar 22, 2019Updated 7 years ago
angang-li / sparkify
View on GitHub
Predict churn with Apache Spark
☆12Feb 2, 2019Updated 7 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
JQVeenstra / arfima
View on GitHub
Now updated prior to the version on CRAN.
☆15Jan 9, 2024Updated 2 years ago
Nithin0001 / Electricity-Bill-Management-System
View on GitHub
DBMS project on Electricity Bill Management
☆27Apr 5, 2022Updated 4 years ago
ArbenKqiku / LinkedInRAds
View on GitHub
☆14Oct 25, 2020Updated 5 years ago
josephmachado / data_engineering_best_practices_log
View on GitHub
Code to demonstrate data engineering metadata & logging best practices
☆22Mar 12, 2024Updated 2 years ago
JamesRaynard / Update-Modern-Cpp
View on GitHub
The source code for my Udemy course "Update to Modern C++"
☆14Apr 16, 2026Updated 3 months ago
romulovieira777 / Data_Engineering_Essentials_Hands_on_SQL_Python_and_Spark
View on GitHub
☆13Feb 18, 2022Updated 4 years ago
PastorGL / datacooker-etl
View on GitHub
ETL processing toolset with SQL-like language and GIS capabilities, built on core Spark. Extensible and modular. REPL included
☆16Jun 12, 2026Updated last month
ashishrpandey / edyodapipelinedemo
View on GitHub
A maven based java project
☆12Mar 20, 2022Updated 4 years ago
aiwithqasim / pyspark_bigdata
View on GitHub
Getting started with PySpark for Big data analysis
☆10Aug 24, 2022Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
josephmachado / socialetl
View on GitHub
Project for "Data pipeline design patterns" blog.
☆53Aug 6, 2024Updated last year
OtusTeam / data-engineer
View on GitHub
☆12May 19, 2021Updated 5 years ago
TheoViel / kaggle_contrails
View on GitHub
2nd Place Solution for the Google Research - Identify Contrails to Reduce Global Warming Competition
☆14Aug 15, 2023Updated 2 years ago
benjaminjost / elastic-siem
View on GitHub
Elastic SIEM template for docker
☆19Oct 6, 2021Updated 4 years ago
obulygin / pyda_homeworks
View on GitHub
☆15May 7, 2025Updated last year
nigelpoulton / dockercon2023-wasm-lab
View on GitHub
Lab instructions for Wasm lab at DockerCon 2023
☆21Oct 16, 2023Updated 2 years ago
moritzkoerber / covid-19-data-engineering-pipeline
View on GitHub
A Covid-19 data pipeline on AWS featuring PySpark/Glue, Docker, Great Expectations, Airflow, and Redshift, templated in CloudFormation an…
☆24Nov 21, 2023Updated 2 years ago
gjustin40 / Pytorch-Cookbook
View on GitHub
Practice Pytorch
☆10Feb 14, 2023Updated 3 years ago
avensolutions / spark-sql-etl-framework
View on GitHub
Multi-stage, config driven, SQL based ETL framework using PySpark
☆26Sep 16, 2019Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Nischaydnk / HubMap-2023-3rd-Place-Solution
View on GitHub
Winning 3rd Place solution for HubMap - Hacking the Human Vasculature hosted on Kaggle
☆15Aug 10, 2023Updated 2 years ago
amelinvladimir / clickhouse_course
View on GitHub
☆17Apr 17, 2026Updated 3 months ago
dimoobraznii1986 / Assignments
View on GitHub
☆16Feb 12, 2025Updated last year
LeadingIndiaAI / Drowsiness-Detection-Using-Facial-Images
View on GitHub
The project focuses on the drowsiness of IT employees, drivers, pilots, crane operators, student etc. These people need a system which ca…
☆14Sep 13, 2018Updated 7 years ago
HROlive / Advanced-Deep-Learning-with-Transformers
View on GitHub
Workshop that will take you from Graph Neural Networks (GNNs) to Transformers, architectures which have led to numerous breakthrough achi…
☆12Sep 11, 2023Updated 2 years ago
rbiswasfc / kaggle-feedback3-efficiency-1st-place
View on GitHub
1st place (Efficiency Track) solution for Feedback Prize - English Language Learning Kaggle competition
☆11Dec 17, 2022Updated 3 years ago
darshilparmar / Udacity-Data-Engineer-nanodegree
View on GitHub
Classwork projects and home works done through Udacity data engineering nano degree
☆10Jun 6, 2021Updated 5 years ago