SemyonSinchenko / flake8-pyspark-with-columnView external linksLinks
A flake8 plugin that detects of usage withColumn in a loop or inside reduce
☆28Jun 20, 2025Updated 7 months ago
Alternatives and similar repositories for flake8-pyspark-with-column
Users that are interested in flake8-pyspark-with-column are comparing it to the libraries listed below
Sorting:
- SparkConnect Server plugin and protobuf messages for the Amazon Deequ Data Quality Engine.☆26Feb 22, 2025Updated 11 months ago
- PySpark schema generator☆44Feb 23, 2023Updated 2 years ago
- ☆16Oct 17, 2024Updated last year
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆27Mar 25, 2024Updated last year
- Tools for Microsoft Fabric☆24Jul 17, 2025Updated 6 months ago
- ⛅ Run OpenVSCode Server in Google Cloud Shell☆11Dec 22, 2023Updated 2 years ago
- PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it☆78Apr 27, 2025Updated 9 months ago
- Delta Lake helper methods in PySpark☆327Jan 19, 2026Updated 3 weeks ago
- ☆10Jul 1, 2022Updated 3 years ago
- Implementation of core-expansion algorithm☆11Jan 26, 2026Updated 2 weeks ago
- ☆10Aug 23, 2023Updated 2 years ago
- Parent repository for the MOJ Analytics Platform☆14Nov 16, 2021Updated 4 years ago
- The ONS Big Data Team Github pages☆10May 19, 2021Updated 4 years ago
- Spark Structured Streaming State Tools☆34Jul 3, 2020Updated 5 years ago
- See Apache Kylin Website for a complete description☆30May 28, 2018Updated 7 years ago
- The privacy-preserving record linkage toolkit: a proof-of-concept public demo of next-gen data linkage techniques.☆15May 22, 2024Updated last year
- Single node Cloudera environment in docker☆10Jan 16, 2016Updated 10 years ago
- Cl app / pre-commit hook to clean Jupyter Notebooks metadata, execution_count and optionally output.☆11Mar 3, 2025Updated 11 months ago
- A toolkit of functions and classes to help build isometric games with Lua☆16Apr 21, 2025Updated 9 months ago
- Examples of Selenium in Python☆11Jun 11, 2018Updated 7 years ago
- ☆13Updated this week
- ☆11Jan 28, 2019Updated 7 years ago
- Reproducible Analytical Pipeline of the Hospital Standardised Mortality Ratio (HSMR) quarterly publication☆11Jun 21, 2024Updated last year
- Reproducible Research in Finse☆10Aug 5, 2020Updated 5 years ago
- A write-audit-publish implementation on a data lake without the JVM☆45Aug 12, 2024Updated last year
- TBD☆10Oct 30, 2015Updated 10 years ago
- freakin' simple yo api wrapper for nodejs☆17Dec 18, 2014Updated 11 years ago
- Deploying a simple FastAPI app to Fly.io >> https://fly-fastapi.fly.dev/docs <<☆14Oct 2, 2023Updated 2 years ago
- 資料匯入的程式碼參考☆10Oct 13, 2016Updated 9 years ago
- Simple examples showing how to use ADBC with various databases, query engines, and data platforms☆37Jan 28, 2026Updated 2 weeks ago
- Records the execution of .NET programs, to create scenarios in AppMap files☆12Jul 25, 2024Updated last year
- Embedded Linux☆11Jul 11, 2024Updated last year
- Run greatexpectations.io on ANY SQL Engine using REST API. Supported by FastAPI, Pydantic and SQLAlchemy as best data quality tool☆14Dec 12, 2025Updated 2 months ago
- Discord bot used for managing an AWS instance including toggling on or off☆11Mar 27, 2024Updated last year
- ☆10Sep 14, 2018Updated 7 years ago
- ☆20Jan 31, 2026Updated 2 weeks ago
- Helpful user defined fuctions / table generating functions for Hive☆101May 2, 2016Updated 9 years ago
- ☆11Mar 1, 2024Updated last year
- Lazily initialized ASGI apps☆12Jan 21, 2025Updated last year