Apache (Py)Spark type annotations (stub files).
☆118Aug 17, 2022Updated 3 years ago
Alternatives and similar repositories for pyspark-stubs
Users that are interested in pyspark-stubs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Asynchronous actions for PySpark☆48Dec 2, 2021Updated 4 years ago
- Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks☆360Jun 6, 2017Updated 8 years ago
- ☆16May 31, 2017Updated 8 years ago
- pyspark methods to enhance developer productivity 📣 👯 🎉☆685Mar 6, 2025Updated last year
- Storm Database Explorer - Developing Data Products course project.☆11May 3, 2017Updated 8 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Schema Registry integration for Apache Spark☆40Nov 16, 2022Updated 3 years ago
- Helpers & syntactic sugar for PySpark.☆62Dec 4, 2025Updated 3 months ago
- Mirror of Apache Toree (Incubating)☆749Updated this week
- Jupyter magics and kernels for working with remote Spark clusters☆1,362Sep 9, 2025Updated 6 months ago
- pytest plugin to run the tests with support of pyspark☆88May 21, 2025Updated 10 months ago
- Spark style guide☆272Sep 30, 2024Updated last year
- Material for the Jupytext+Papermill blog post☆31Jun 30, 2020Updated 5 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆17Jan 12, 2017Updated 9 years ago
- Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark☆1,539Dec 2, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Spark functions to run popular phonetic and string matching algorithms☆59Feb 22, 2022Updated 4 years ago
- A Hivemall wrapper for Spark☆31Apr 21, 2016Updated 9 years ago
- Base classes to use when writing tests with Spark☆1,549Dec 22, 2025Updated 3 months ago
- SparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.☆154Jul 31, 2020Updated 5 years ago
- Test suite to document the behavior of Spark☆21Apr 15, 2021Updated 4 years ago
- Real-world Spark pipelines examples☆83Feb 27, 2018Updated 8 years ago
- CLI Based Browser for S3 Buckets☆14Aug 12, 2016Updated 9 years ago
- A toolset to streamline running spark python on EMR☆20Nov 16, 2016Updated 9 years ago
- This repository contains the development code for sparkMeasure, an Apache Spark performance analysis and troubleshooting library. It simp…☆818Mar 4, 2026Updated 3 weeks ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Filling in the Spark function gaps across APIs☆50Apr 14, 2021Updated 4 years ago
- Apache Spark Website☆134Updated this week
- A tool for running Spark on Google Compute Engine☆16Jan 20, 2017Updated 9 years ago
- Apache Spark on Kubernetes☆19Mar 19, 2017Updated 9 years ago
- sparkql: Apache Spark SQL DataFrame schema management for sensible humans☆12Sep 18, 2023Updated 2 years ago
- Postgres extension drivers for quill☆15Oct 31, 2016Updated 9 years ago
- A curated list of awesome Apache Spark packages and resources.☆1,866Feb 27, 2026Updated last month
- Parametrize and run scripts as notebooks with jupytext and papermill☆18Sep 29, 2019Updated 6 years ago
- Dataclass with data validation. Checks the value of its fields by their annotations.☆13Jan 7, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Redis search and indexing in Java☆16Sep 26, 2016Updated 9 years ago
- A simple example for PySpark based project.☆11Jun 3, 2016Updated 9 years ago
- A connector for SingleStore and Spark☆162Sep 24, 2025Updated 6 months ago
- Introductory interactive Jupyter tutorial providing details about ORMs in order to assist in the teaching of their use to computing scien…☆14Oct 21, 2025Updated 5 months ago
- CLI tool to launch Spark jobs on AWS EMR☆67Oct 18, 2023Updated 2 years ago
- A client for the Confluent Schema Registry API implemented in Python☆53Mar 18, 2023Updated 3 years ago
- Spark data profiling utilities☆23Nov 24, 2018Updated 7 years ago