ayushdixit487/Uber-Data-Analysis-Project-in-Pyspark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ayushdixit487/Uber-Data-Analysis-Project-in-Pyspark)

ayushdixit487 / Uber-Data-Analysis-Project-in-Pyspark

This data project can be used as a take-home assignment to learn Pyspark and Data Engineering.

☆20

Alternatives and similar repositories for Uber-Data-Analysis-Project-in-Pyspark

Users that are interested in Uber-Data-Analysis-Project-in-Pyspark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cordon-thiago / spark-kafka-consumer
View on GitHub
Spark application to consume kafka events generated by a python producer.
☆12Aug 7, 2021Updated 4 years ago
AWS-Big-Data-Projects / big-data-solutions
View on GitHub
This repository provides Code examples written in Python,Spark-Scala using primarily boto3 SDK API methods and aws cli examples for major…
☆14Mar 6, 2022Updated 4 years ago
MarcusElwin / ner-dspy
View on GitHub
Using DSPy for NER tasks using LLMs
☆17Apr 1, 2024Updated 2 years ago
itversity / pyspark
View on GitHub
Repository for Spark using Python material. It is popularly known as PySpark.
☆21Aug 18, 2021Updated 4 years ago
AuFeld / Data_Engineering_Projects
View on GitHub
A collection of data engineering projects: data modeling, ETL pipelines, data lakes, infrastructure configuration on AWS, data warehousin…
☆15Apr 29, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
wilson-mok / demo
View on GitHub
In this repository, you will find varies demo and presentations I have delivered throughout the year. This includes the link to the video…
☆15Jun 27, 2026Updated last month
ayushdixit487 / CCDAK-Exam-Practice-Test-2
View on GitHub
Complete high-quality practice tests of 50 questions each will help you master your Confluent Certified Developer for Apache Kafka (CCDAK…
☆90Aug 28, 2023Updated 2 years ago
arverma / TowardsDataEngineering
View on GitHub
This repo contains commands that data engineers use in day to day work.
☆64Feb 4, 2023Updated 3 years ago
Rohanvp07 / Covid-19-Analysis-and-Prediction
View on GitHub
☆16Feb 20, 2026Updated 5 months ago
AR6420 / Hail_Hydra
View on GitHub
🐉 Hail Hydra — Multi-headed speculative execution framework for Claude Code. 10 AI agents, 3x faster, ~70% cheaper. Inspired by speculat…
☆45Jun 11, 2026Updated last month
xenodium / xenodium.github.io
View on GitHub
☆11Jul 22, 2026Updated last week
Ashraf1395 / supply_chain_finance
View on GitHub
☆17Apr 19, 2024Updated 2 years ago
damklis / etljob
View on GitHub
Simple ETL pipeline using Python
☆29May 22, 2023Updated 3 years ago
jmcmt87 / spark_app_twitter
View on GitHub
A data engineering project (Twitter monitor app)
☆87Jun 27, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zhenhua-wang / emacs.d
View on GitHub
My Emacs Config
☆14Jul 22, 2026Updated last week
tumashu / pyim-wbdict
View on GitHub
Wubi dicts for pyim
☆14Jan 13, 2023Updated 3 years ago
yitang / .emacs.d
View on GitHub
Emacs configuration
☆15May 1, 2026Updated 2 months ago
glen-dai / highlight-global
View on GitHub
A highlight package for EMACS across all buffers/files.
☆15Nov 5, 2015Updated 10 years ago
souvik131 / exposure
View on GitHub
☆12Jun 18, 2024Updated 2 years ago
akshatgadodia / django-docker-starter-project
View on GitHub
☆14Sep 13, 2024Updated last year
hoe94 / DTC_MLOPS_Project
View on GitHub
This is the end to end MLOps project I built through participated the MLOps Zoomcamp
☆10Sep 11, 2022Updated 3 years ago
martandsingh / ApacheSpark
View on GitHub
This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which…
☆105Sep 26, 2025Updated 10 months ago
FareedKhan-dev / AI-outlier-detection
View on GitHub
Outlier Detection with AI + ML
☆15Sep 12, 2025Updated 10 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
LeadingIndiaAI / Drowsiness-Detection-Using-Facial-Images
View on GitHub
The project focuses on the drowsiness of IT employees, drivers, pilots, crane operators, student etc. These people need a system which ca…
☆14Sep 13, 2018Updated 7 years ago
gregnewman / gmacs
View on GitHub
My emacs configuration for org-mode, python, javascript and react development.
☆16Jul 7, 2026Updated 3 weeks ago
muraliprajapati / watchlistpro
View on GitHub
Advanced TradingView like watchlists for Zerodha Kite
☆12Jun 27, 2026Updated last month
darshilparmar / Udacity-Data-Engineer-nanodegree
View on GitHub
Classwork projects and home works done through Udacity data engineering nano degree
☆10Jun 6, 2021Updated 5 years ago
thomst / django-admin-filter
View on GitHub
Django-admin-filter is a generic form-based filter for the django-admin-page.
☆17Sep 2, 2022Updated 3 years ago
Stefen-Taime / modern-data-pipeline
View on GitHub
reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.
☆15Jun 26, 2023Updated 3 years ago
angelddaz / de-challenges
View on GitHub
Project based learning for Data Engineering fundamentals.
☆13Jan 15, 2021Updated 5 years ago
pyplanex / django_with_data_science
View on GitHub
Project on how to integrate django with data science libraries (i.e. pandas, matplotlib, numpy)
☆14Jul 6, 2023Updated 3 years ago
itversity / spark-sql
View on GitHub
Apache Spark using SQL
☆14Aug 18, 2021Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Wittline / recommendation-system
View on GitHub
Build a Content-Based Movie Recommender System (TF-IDF, BM25, BERT)
☆15Jun 13, 2022Updated 4 years ago
Innova-Group-LLC / custom_admin
View on GitHub
A custom admin interface providing backend via DRF and frontend via Vue 2 and Element UI.
☆19Oct 3, 2024Updated last year
AveryData / airbnbanalytics
View on GitHub
☆11Aug 11, 2022Updated 3 years ago
airscholar / Japan-visa-data-engineering
View on GitHub
This project provides an end-to-end data processing and visualization of visa numbers in Japan using PySpark and Plotly. The spark cluste…
☆11Oct 11, 2023Updated 2 years ago
ketaketish / GhostMouse
View on GitHub
GhostMouse is a lightweight program that allows you to record your mouse movement and clicks and plays them back.
☆13Feb 22, 2019Updated 7 years ago
neelabhsinha / Drowsiness-Detection-in-Drivers-using-Deep-Learning
View on GitHub
This repository contains the files related to the project on frame-by-frame Drowsiness Detection in Drivers in videos using facial featur…
☆12Jul 22, 2024Updated 2 years ago
kb22 / NASA-data-exploration
View on GitHub
The repository includes detailed steps to get data from GES DISC, convert HDF5 files to CSV and plotting geographic data.
☆11Aug 17, 2020Updated 5 years ago