This project uses PySpark and Python to analyze a Google Play Store dataset. It covers data cleaning, duplicate removal, and visual analysis, performed in Jupyter Notebook with Spark's distributed computing.
☆12Apr 6, 2022Updated 3 years ago
Alternatives and similar repositories for Big-Data-Analytics-and-Visualization-Using-PySpark
Users that are interested in Big-Data-Analytics-and-Visualization-Using-PySpark are comparing it to the libraries listed below
Sorting:
- asw.cluster R package for calculating group faultlines☆12Aug 20, 2023Updated 2 years ago
- Pytorch Implementation of the Explainable Conditional Adversarial Autoencoder using Saliency Maps and SHAP (J. of Imaging - MDPI)☆12Mar 5, 2025Updated 11 months ago
- An end-to-end open-source data stack for crawling and visualizing real estate data, facilitating insights into market trends.☆15May 23, 2024Updated last year
- ChatTube: A Retrieval QA System to Youtube Videos☆10Jun 6, 2023Updated 2 years ago
- This Repository contains the Wine Quality Test Project.☆10Jul 12, 2020Updated 5 years ago
- This UE4 project contains the Telekinesis Mechanic for Control☆11Jul 26, 2020Updated 5 years ago
- Pair Trading Analysis & Exercises Toolkit [Jupyter Notebook]☆12Nov 3, 2023Updated 2 years ago
- My implemention of Hidden Markov Model(HMM) and Conditional Random Field(CRF) for Part of Speech tagging in python 3.6☆11Nov 7, 2018Updated 7 years ago
- ☆11Feb 6, 2018Updated 8 years ago
- 哈工大机器学习作业一——多项式拟合曲线☆10Oct 19, 2016Updated 9 years ago
- The Kingdom Hearts 3 randomizer and garden of assemblage mod.☆10Jan 3, 2023Updated 3 years ago
- ☆12Nov 1, 2023Updated 2 years ago
- Unofficial Reproduction: Capacity estimation of lithium-ion batteries based on adaptive empirical wavelet transform and long short-term m…☆12Oct 28, 2024Updated last year
- Production-ready Chainlit RAG application with Pinecone pipeline offering all Groq and OpenAI Models, to chat with your documents.☆11Aug 19, 2025Updated 6 months ago
- Implement Fluid Simulation (FLIP) on Unreal Engine 5 with NVIDIA GVDB Library☆12Nov 30, 2023Updated 2 years ago
- This project aims at giving the best customer service ever using the power of LLM models like GPT.☆10Jun 29, 2023Updated 2 years ago
- The pathfinding implementation in the Unreal Engine 5☆12Dec 17, 2023Updated 2 years ago
- Notebooks exploring the Canadian Institute of Cybersecurity's IoT dataset.☆11Mar 6, 2024Updated last year
- Rope collision in cpp☆12Jun 2, 2025Updated 9 months ago
- Code for Learning idiolectal style variation in online register☆10May 18, 2023Updated 2 years ago
- A remaster of Monolith's 1997 Captain Claw game using Unreal Engine 4.26.2 written in C++ and focusses on Object Oriented Design.☆12Dec 16, 2022Updated 3 years ago
- Menger Sponge Fractal☆11Jan 8, 2023Updated 3 years ago
- Finetuning Mask2Former on semantic segmentation using custom dataset☆15May 31, 2024Updated last year
- Blockchain and AI are on just about every chief information officers watchlist of game-changing technologies that stand to reshape indust…☆11Dec 10, 2021Updated 4 years ago
- Workflow management system written as a pure Python package and command-line utility. It supports complex workflows modeled as directed- …☆16Jan 28, 2025Updated last year
- Time Series Regression with Python☆11Feb 21, 2024Updated 2 years ago
- ☆14Mar 4, 2014Updated 11 years ago
- 本仓库包含了一系列基于 ROS 的功能包,主要围绕 Piper机械臂以及 Orbbec 深度相机等硬件的应用展开,涵盖了模型描述、控制、感知、语音交互、运动规划以及仿真等多个方面,☆19Jun 28, 2025Updated 8 months ago
- hudi-spark-utilities-plus☆11Jul 29, 2022Updated 3 years ago
- Completed Unreal Engine replay system tutorial (blueprint version)☆13Jun 3, 2018Updated 7 years ago
- Flan T5 LLM fine-tuning, by attaching a regression model last hidden layers activations. Runs on colab with A100 40gb☆13Mar 24, 2023Updated 2 years ago
- ☆12Feb 2, 2024Updated 2 years ago
- A python script to take Growatt RS485 ModBus data, pass it to an MQTT broken, InfluxDB and Grafana☆13Aug 8, 2023Updated 2 years ago
- Implementing isometric 3D effect in a pure 2D environment.☆14Apr 21, 2021Updated 4 years ago
- Bot that tracks new ads at sahibinden.com and notifies telegram channel☆13May 28, 2022Updated 3 years ago
- docker-compose for Red Hat Quay Registry☆12Sep 3, 2022Updated 3 years ago
- AI leetcode interviewer that assesses tech applicants. Built on Langchain and OpenAI APIs. Recruiter-focused and tracks progress and subm…☆15Jun 6, 2023Updated 2 years ago
- A combination of extractive and abstractive text summarization for summarizing long scientific texts☆15Feb 7, 2023Updated 3 years ago
- ☆12Jun 17, 2025Updated 8 months ago