Data for the `Data Analysis with Python and PySpark` book
☆42Jan 9, 2023Updated 3 years ago
Alternatives and similar repositories for DataAnalysisWithPythonAndPySpark-Data
Users that are interested in DataAnalysisWithPythonAndPySpark-Data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code repository for the "PySpark in Action" book☆218Jun 11, 2025Updated 11 months ago
- Data Wrangling with Python 3.x, published by Packt☆20Jan 30, 2023Updated 3 years ago
- ☆11Oct 6, 2023Updated 2 years ago
- Presentation materials from community (public) presentations☆11May 6, 2021Updated 5 years ago
- ☆11Apr 27, 2020Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- California Health & Human Services Agency Data Playbook☆14Sep 30, 2020Updated 5 years ago
- Databricks Certified Associate Developer for Apache Spark Using Python, Published by Packt☆12Jun 13, 2024Updated last year
- Python☆15Oct 27, 2023Updated 2 years ago
- Databricks CI/CD using Azure DevOps☆21Nov 1, 2022Updated 3 years ago
- This project have the sample programs for the Azure Databricks technical enablement workshop!☆12Jul 25, 2019Updated 6 years ago
- ☆17Mar 2, 2026Updated 2 months ago
- GitHub Action that installs Databricks CLI☆14Sep 22, 2021Updated 4 years ago
- Python Essentials for AWS Cloud Developers, published by Packt.☆12Apr 27, 2023Updated 3 years ago
- A set of example build and release pipelines for deploying Python and Scala to Azure Databricks and HDInsight☆14Jun 4, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Limitless Analytics with Azure Synapse, published by Packt☆13Mar 2, 2026Updated 2 months ago
- ☆14Jul 26, 2022Updated 3 years ago
- A tutorial on some useful Pandas features☆16Aug 18, 2018Updated 7 years ago
- ☆10May 5, 2022Updated 4 years ago
- Example Power BI files☆18Sep 17, 2024Updated last year
- Building a real-time alert monitoring pipeline that sends email notifications off of Azure Event Hubs, Azure Databricks, and a Azure Logi…☆13Mar 8, 2020Updated 6 years ago
- This repository is used to demonstrate Tabular Editor integration with GitHub Actions for Power BI or Analysis Services CI/CD☆25Mar 20, 2023Updated 3 years ago
- ☆95Sep 14, 2022Updated 3 years ago
- Repository for Microsoft Databricks Training Events - Hosted by BlueGranite☆15Aug 22, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆14Mar 13, 2021Updated 5 years ago
- Streamlit Dashboard over Superstore Data stored in Postgres Docker container. With SQLAlchemy + Plotly Express☆12Oct 16, 2024Updated last year
- Examples of metadata driven SQL processes implemented in Databricks☆16May 21, 2021Updated 4 years ago
- Azure for Architect - Second Edition, published by Packt☆25May 7, 2019Updated 7 years ago
- Code for utilising VAE as means of doing exact MCMC inference in complex high-dimensional space☆14Jun 20, 2023Updated 2 years ago
- Optimizing Databricks Workload, published by Packt☆18Apr 22, 2026Updated 3 weeks ago
- Plugin for Intake to read from SQL servers☆15May 29, 2023Updated 2 years ago
- ☆22Jul 14, 2020Updated 5 years ago
- ☆21Feb 6, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆22Apr 24, 2019Updated 7 years ago
- ☆30Jul 2, 2024Updated last year
- Normalized Wasserstein for Mixture Distributions☆11Mar 24, 2023Updated 3 years ago
- A Model Completion Protocol (MCP) server for interacting with Databricks services☆45Mar 23, 2025Updated last year
- Spark Databricks Notebooks☆15Dec 19, 2020Updated 5 years ago
- Principal Geodesic Analysis in the Wasserstein space☆10Jun 19, 2018Updated 7 years ago
- Repository for Spark using Python material. It is popularly known as PySpark.☆20Aug 18, 2021Updated 4 years ago