Column-wise type annotations for pyspark DataFrames
☆107Jun 28, 2026Updated this week
Alternatives and similar repositories for typedspark
Users that are interested in typedspark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a comprehensive end-to-end data engineering project. I extracted data directly from YouTube in raw JSON format using Python and A…☆12Jun 4, 2024Updated 2 years ago
- Mirror of Apache DataFu☆124May 18, 2026Updated last month
- ReactiveX for data science☆14Sep 18, 2025Updated 9 months ago
- PySpark test helper methods with beautiful error messages☆771May 20, 2026Updated last month
- Ambari and Cloudera Manager in Docker☆22Mar 7, 2019Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆12Mar 17, 2022Updated 4 years ago
- ✨ A Pydantic to PySpark schema library☆127Jun 23, 2026Updated last week
- Example files used in the DuckDB - Unity Catalog blog☆10Dec 6, 2024Updated last year
- This library contains the Kinesis Analytics stream processing runtime configuration classes.☆11Jan 26, 2026Updated 5 months ago
- code-snippets☆14Apr 9, 2026Updated 2 months ago
- PySpark schema generator☆44Feb 23, 2023Updated 3 years ago
- Edmonds's blossom algorithm for maximum weight matching in undirected graphs☆18Jan 14, 2021Updated 5 years ago
- Custom PySpark Connectors☆100Mar 3, 2026Updated 4 months ago
- Python async data gathering☆11Nov 18, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆49Oct 15, 2024Updated last year
- textwrap.dedent with t-string support☆24Dec 15, 2025Updated 6 months ago
- A simple plugin to insert the correct shebang of the file.☆11Apr 22, 2017Updated 9 years ago
- library for processing s3select queries and execute them on CSV files (current phase)☆18Jan 5, 2026Updated 5 months ago
- Repository for Spark using Python material. It is popularly known as PySpark.☆20Aug 18, 2021Updated 4 years ago
- ☆12Jun 26, 2023Updated 3 years ago
- Handy Structure Query Language Queries, for a variety of databases☆19Apr 30, 2025Updated last year
- SparkConnect Server plugin and protobuf messages for the Amazon Deequ Data Quality Engine.☆26Feb 22, 2025Updated last year
- Aerospike Provider for Apache Airflow☆19Feb 21, 2026Updated 4 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Evaluation Matrix for Change Data Capture☆25Aug 6, 2024Updated last year
- type-class based data cleansing library for Apache Spark SQL☆79Jun 23, 2019Updated 7 years ago
- A small project to show how to add lineage to Atlas when using Spark as ETL tool☆12Nov 29, 2016Updated 9 years ago
- Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, Kafka Stream API and Hazelcast Jet☆10Apr 3, 2024Updated 2 years ago
- Simple script to export 1-to-1 chat history from HipChat☆11Jun 22, 2019Updated 7 years ago
- ☆14Jun 22, 2026Updated last week
- A repository used in a NiFi Registry demo☆13Mar 11, 2020Updated 6 years ago
- ☆15Mar 11, 2020Updated 6 years ago
- Spark/Cassandra/Akka combo to visualize a cloud of words using d3.js☆11Dec 6, 2015Updated 10 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Comment on files and notebooks in JupyterLab☆17Dec 3, 2021Updated 4 years ago
- Filling in the Spark function gaps across APIs☆50Apr 14, 2021Updated 5 years ago
- An SBT Plugin that acts as a light wrapper around Buf.☆10Oct 29, 2024Updated last year
- This is a simple script that parses python files in a directory and generates a mxfile containing a diagramm of classes, attributes and m…☆11Feb 23, 2023Updated 3 years ago
- Linux keyboard mapping utility☆14Nov 25, 2021Updated 4 years ago
- Interactive Data Visualization in JupyterLab☆21Apr 12, 2022Updated 4 years ago
- A simple docker container that runs ssh☆20Apr 6, 2020Updated 6 years ago