PySpark schema generator
☆44Feb 23, 2023Updated 3 years ago
Alternatives and similar repositories for tinsel
Users that are interested in tinsel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A flake8 plugin that detects of usage withColumn in a loop or inside reduce☆28Jun 20, 2025Updated 9 months ago
- Shed light on your data layout in order to monitor the health of your Lakehouse tables and identify when data maintenance operations shou…☆10Jul 31, 2023Updated 2 years ago
- Map your python dataclasses to pyspark types☆10Feb 11, 2024Updated 2 years ago
- A DataOps framework for building a lakehouse.☆56Updated this week
- Code samples, etc. for Databricks☆73Feb 11, 2026Updated last month
- A python SPark ETL libRary (SPETLR) for Databricks. https://discord.gg/p9bzqGybVW☆24Mar 3, 2026Updated 2 weeks ago
- Pandas helper functions☆31Feb 19, 2023Updated 3 years ago
- Instant search for and access to many datasets in Pyspark.☆34Oct 6, 2022Updated 3 years ago
- agogosml is a flexible data processing pipeline that addresses the common need for operationalizing ML models at scale☆34May 3, 2019Updated 6 years ago
- ☆24Jun 23, 2021Updated 4 years ago
- A proof of concept of how to integrate Spark Lineage in Azure Purview☆21Mar 16, 2021Updated 5 years ago
- Delta lake and filesystem helper methods☆50Feb 29, 2024Updated 2 years ago
- ✨ A Pydantic to PySpark schema library☆121Updated this week
- Marshmallow serializer integration with pyspark☆12Dec 29, 2023Updated 2 years ago
- ☆10Mar 16, 2024Updated 2 years ago
- Delta Lake helper methods. No Spark dependency.☆22Jan 19, 2026Updated 2 months ago
- A Configuration System for Airflow☆16Updated this week
- This is my capstone project for the Data Incubator Data Science Fellowship. This project aims at building a tool that provides visualizat…☆10Nov 6, 2019Updated 6 years ago
- Quickstart PySpark with Anaconda on AWS/EMR☆52Jan 9, 2017Updated 9 years ago
- type-class based data cleansing library for Apache Spark SQL☆78Jun 23, 2019Updated 6 years ago
- Grafana plugin to display air conditions on a psychrometric chart.☆14Mar 16, 2026Updated last week
- ☆26Mar 4, 2024Updated 2 years ago
- Репозиторий курса "Modern Storages and Data Warehousing", ПИ, НИУ ВШЭ, 2024☆14Apr 13, 2025Updated 11 months ago
- PySpark test helper methods with beautiful error messages☆755Feb 25, 2026Updated 3 weeks ago
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10May 12, 2023Updated 2 years ago
- ☆18Apr 20, 2018Updated 7 years ago
- A conda-smithy repository for polars.☆12Updated this week
- Delta Lake helper methods in PySpark☆328Jan 19, 2026Updated 2 months ago
- Python wrapper for lsm1 extension for sqlite4☆15Feb 27, 2025Updated last year
- Library and command-line tool to gather stats on typeshed packages☆12Updated this week
- Framework for simpler Spark Pipelines☆11Mar 6, 2026Updated 2 weeks ago
- A conda-smithy repository for ollama.☆10Updated this week
- Scripts for Azure Synapse SQL Pools (Provisioned) and Query-on-Demand (Serverless)☆11Nov 2, 2021Updated 4 years ago
- Documentation repository for cmake-tools☆11Oct 25, 2023Updated 2 years ago
- Modeling directed acyclic graphs (DAG) for topological sorting, shortest path, longest path, etc.☆14Sep 1, 2017Updated 8 years ago
- Python code that will collapse structured columns separating out the attributes into new columns☆10Mar 15, 2022Updated 4 years ago
- awaits the completion of multiple async tasks☆12Nov 29, 2015Updated 10 years ago
- An example of SparkConnect extension.☆15Mar 5, 2024Updated 2 years ago
- Visual Studio Code Server on Azure Web App for Containers☆10Apr 12, 2019Updated 6 years ago