This is the first project where we worked on apache spark, In this project what we have done is that we downloaded the datasets from KAGGLE where everyone is aware of, we have downloaded loan, customers credit card and transactions datasets . After downloading the datsaets we have cleaned the data . Then after by using new tools and technologies…
☆22Oct 14, 2021Updated 4 years ago
Alternatives and similar repositories for pyspark-project
Users that are interested in pyspark-project are comparing it to the libraries listed below
Sorting:
- Build and run Spark Structured Streaming pipelines in Hadoop - project using PySpark.☆13Jun 6, 2019Updated 6 years ago
- Spark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.☆21Jan 30, 2019Updated 7 years ago
- Improving the development of Spark applications deployed as jobs on AWS services like Glue and EMR☆10Jul 26, 2023Updated 2 years ago
- ☆10May 19, 2022Updated 3 years ago
- ☆12Jan 25, 2018Updated 8 years ago
- ☆11Jun 15, 2019Updated 6 years ago
- Files to Build a Docker Image for Facebook Prophet☆13Feb 7, 2019Updated 7 years ago
- Conteúdo das aulas da turma 6 do bootcamp de engenharia de dados da How☆12Sep 16, 2021Updated 4 years ago
- ☆14Feb 20, 2023Updated 3 years ago
- IBGE - Censo 2010 - Localização e respectivo Código de Setor Censitário☆10Apr 3, 2021Updated 4 years ago
- Prediction of Premier League results using Machine Learning☆11Jul 11, 2024Updated last year
- ☆10Mar 14, 2021Updated 4 years ago
- Platzi - Curso Optimización de SQL☆14Jan 13, 2021Updated 5 years ago
- This Guidance helps customers set up an ecommerce website on WordPress.☆11Oct 19, 2024Updated last year
- This repository contains example patterns for storing large objects with DynamoDB.☆13Jun 19, 2024Updated last year
- SQL☆21Jul 15, 2017Updated 8 years ago
- Github Workflows üzerinde Çalışan A101 Aktüel Telegam Bot☆14Sep 29, 2023Updated 2 years ago
- An example CI/CD pipeline using GitHub Actions for doing continuous deployment of AWS Glue jobs built on PySpark and Jupyter Notebooks.☆13Oct 15, 2020Updated 5 years ago
- ☆15Jan 11, 2024Updated 2 years ago
- ☆10May 30, 2021Updated 4 years ago
- Accessibility-ready business WordPress theme.☆15Sep 3, 2025Updated 6 months ago
- Real World Project on Formula1 Racing using Azure Databricks, Delta Lake and Azure Data Factory☆13Jul 24, 2023Updated 2 years ago
- WARNING: This repository is no longer maintained The Insights for Twitter service from IBM Cloud has been sunset. This repository will n…☆11Apr 10, 2019Updated 6 years ago
- It is important that credit card companies are able to recognize fraudulent credit card transactions so that customers are not charged fo…☆13Jun 12, 2021Updated 4 years ago
- LUMIN: Your data analysis companion that turns natural language questions into powerful insights through AI-driven visualizations and cle…☆15Nov 11, 2024Updated last year
- Um template para criar um FAQ chatbot usando Rasa, Rocket.chat, elastic search☆14Oct 5, 2021Updated 4 years ago
- Script para ingestão de dados do Mercado Bitcoin☆11Jun 29, 2023Updated 2 years ago
- Google Data Studio connector example code☆11Nov 26, 2018Updated 7 years ago
- ☆13Feb 14, 2024Updated 2 years ago
- Lambda serverless workshop☆13Aug 23, 2018Updated 7 years ago
- Simple demo for Databricks!☆14Sep 11, 2023Updated 2 years ago
- R package for loading data from Google Ads API☆15Sep 3, 2025Updated 6 months ago
- Ciência de dados☆12Aug 25, 2022Updated 3 years ago
- This node recovers new rows from Google Sheets sheet☆14Oct 13, 2022Updated 3 years ago
- ☆11Jul 21, 2021Updated 4 years ago
- This checklist aims to be an exhaustive list of all elements you should consider when using Amazon Redshift.☆15Sep 21, 2020Updated 5 years ago
- Set of various JSON collections (movies, restaurants, recipes, etc) for demos and tutorials☆18Nov 19, 2020Updated 5 years ago
- Automate OpenVPN using AWS EC2 and Python☆13Oct 11, 2025Updated 4 months ago
- Files to Support Class by Thom Ives and Ghaith Sankari and to build examples for textbook☆15Nov 19, 2021Updated 4 years ago