This is the first project where we worked on apache spark, In this project what we have done is that we downloaded the datasets from KAGGLE where everyone is aware of, we have downloaded loan, customers credit card and transactions datasets . After downloading the datsaets we have cleaned the data . Then after by using new tools and technologies…
☆22Oct 14, 2021Updated 4 years ago
Alternatives and similar repositories for pyspark-project
Users that are interested in pyspark-project are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Build and run Spark Structured Streaming pipelines in Hadoop - project using PySpark.☆13Jun 6, 2019Updated 6 years ago
- Spark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.☆21Jan 30, 2019Updated 7 years ago
- An e2e pipeline using dlt, dagster, duckdb, and dbt-core☆20Mar 27, 2026Updated 2 weeks ago
- Github Workflows üzerinde Çalışan A101 Aktüel Telegam Bot☆14Sep 29, 2023Updated 2 years ago
- The Free AWS Certified Cloud Practitioner Study Course☆14Oct 15, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆16May 29, 2023Updated 2 years ago
- ☆18Nov 19, 2022Updated 3 years ago
- PySpark data-pipeline testing and CICD☆28Oct 28, 2020Updated 5 years ago
- Apache Spark using SQL☆14Aug 18, 2021Updated 4 years ago
- StreamSoft enables real-time analysis of any stock market☆15Apr 24, 2024Updated last year
- ☆11Jun 15, 2019Updated 6 years ago
- ☆10May 26, 2021Updated 4 years ago
- Files to Support Class by Thom Ives and Ghaith Sankari and to build examples for textbook☆15Nov 19, 2021Updated 4 years ago
- ☆21Jun 7, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆11Jul 21, 2021Updated 4 years ago
- ☆10May 30, 2021Updated 4 years ago
- This repo contains all code and data for WWCode Python DE workshop Aug 18 and 25 2022☆25Sep 17, 2022Updated 3 years ago
- ☆14Mar 3, 2021Updated 5 years ago
- Git Repository☆153Jan 9, 2026Updated 3 months ago
- A curated list of awesome Machine Learning frameworks, libraries and software.☆17Oct 16, 2019Updated 6 years ago
- Complete SQL + Databases Bootcamp: Zero to Mastery [2020]☆31Sep 29, 2020Updated 5 years ago
- Sample RAG pattern using Azure SQL DB, Langchain and Chainlit☆31Dec 3, 2024Updated last year
- Contains spark dataframe solutions of leetcode questions☆24Dec 13, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆12Jan 25, 2018Updated 8 years ago
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆28Jun 13, 2022Updated 3 years ago
- This tool helps Companies predict accountability of their suppliers and reduces risk involved with Supply Chain Risk Management☆25Apr 2, 2017Updated 9 years ago
- Repository for Spark using Python material. It is popularly known as PySpark.☆20Aug 18, 2021Updated 4 years ago
- Files to Build a Docker Image for Facebook Prophet☆13Feb 7, 2019Updated 7 years ago
- ☆12May 31, 2021Updated 4 years ago
- Improving the development of Spark applications deployed as jobs on AWS services like Glue and EMR☆10Jul 26, 2023Updated 2 years ago
- PySpark Projects☆27Mar 27, 2026Updated 2 weeks ago
- This node recovers new rows from Google Sheets sheet☆14Oct 13, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- WARNING: This repository is no longer maintained The Insights for Twitter service from IBM Cloud has been sunset. This repository will n…☆11Apr 10, 2019Updated 7 years ago
- ☆27Apr 26, 2020Updated 5 years ago
- This Guidance helps customers set up an ecommerce website on WordPress.☆12Oct 19, 2024Updated last year
- Talking Google Analytics reports in Shiny☆14Jun 22, 2018Updated 7 years ago
- Google Data Studio connector example code☆11Nov 26, 2018Updated 7 years ago
- Example repo to create end to end tests for data pipeline.☆25Jun 14, 2024Updated last year
- The open source version of the Amazon Redshift Getting Started Guide.☆15Jun 15, 2023Updated 2 years ago