Delta lake and filesystem helper methods
☆50Feb 29, 2024Updated 2 years ago
Alternatives and similar repositories for jodie
Users that are interested in jodie are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Delta Lake helper methods. No Spark dependency.☆22Jan 19, 2026Updated 2 months ago
- Pandas helper functions☆31Feb 19, 2023Updated 3 years ago
- Write property based tests easily on spark dataframes☆20Jan 19, 2024Updated 2 years ago
- Fake Pandas / PySpark DataFrame creator☆48Mar 10, 2024Updated 2 years ago
- ☆13Oct 4, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10May 12, 2023Updated 2 years ago
- SparkConnect Server plugin and protobuf messages for the Amazon Deequ Data Quality Engine.☆26Feb 22, 2025Updated last year
- Shed light on your data layout in order to monitor the health of your Lakehouse tables and identify when data maintenance operations shou…☆10Jul 31, 2023Updated 2 years ago
- Delta Lake examples☆239Oct 8, 2024Updated last year
- pyspark methods to enhance developer productivity 📣 👯 🎉☆687Mar 6, 2025Updated last year
- Command line client for the Fugue API☆14Mar 7, 2023Updated 3 years ago
- Delta Lake Documentation☆53Jun 19, 2024Updated last year
- Optics for Spark DataFrames☆47Mar 5, 2021Updated 5 years ago
- An example of SparkConnect extension.☆15Mar 5, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- PySpark test helper methods with beautiful error messages☆759Updated this week
- Powershell Scripts for Power BI☆13Sep 20, 2023Updated 2 years ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆62Sep 4, 2023Updated 2 years ago
- PySpark schema generator☆44Feb 23, 2023Updated 3 years ago
- A web application for creating and managing Databricks cluster policies with an interactive UI, allowing users to configure policy attrib…☆17May 7, 2025Updated 11 months ago
- ☆20Jan 17, 2025Updated last year
- Run Apache Airflow on OpenShift☆14Jun 14, 2021Updated 4 years ago
- PySpark phonetic and string matching algorithms☆41Feb 19, 2024Updated 2 years ago
- A Minimalistic Rust Implementation of Delta Sharing Server.☆98Mar 17, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Firestore with Dart Flutter example☆12Jan 11, 2018Updated 8 years ago
- Type safety for spark columns☆79Oct 27, 2025Updated 5 months ago
- Filling in the Spark function gaps across APIs☆50Apr 14, 2021Updated 5 years ago
- Writing PySpark logs in Apache Spark and Databricks☆17Jun 13, 2022Updated 3 years ago
- Code that was used as an example during the Data+AI Summit 2020☆15Mar 8, 2021Updated 5 years ago
- Example Power BI files☆18Sep 17, 2024Updated last year
- A Delta Lake reader for Dask☆54Jul 29, 2025Updated 8 months ago
- Spark Monitoring☆13Feb 28, 2023Updated 3 years ago
- Demonstration of using Files in Repos with Databricks Delta Live Tables☆36Jul 9, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Intake examples☆34Jun 2, 2023Updated 2 years ago
- Unity Catalog UI☆43Sep 6, 2024Updated last year
- Kafka Avro (de)serializer using Apache Jackson☆18Jan 23, 2026Updated 2 months ago
- Kafka using Java code. Original article is hosted on Medium in our engineering blog https://medium.com/pharos-production/kafka-using-java…☆10Dec 8, 2024Updated last year
- A small example setting Python's logging configuration using a module invoked from a notebook.☆10May 14, 2023Updated 2 years ago
- A Python library created to easily use the Power BI REST API with Python☆11Aug 19, 2024Updated last year
- A guide for leading a data (engineering) team☆65May 7, 2024Updated last year