spetlr-org / spetlr
A python SPark ETL libRary (SPETLR) for Databricks. https://discord.gg/p9bzqGybVW
☆21Updated last month
Alternatives and similar repositories for spetlr:
Users that are interested in spetlr are comparing it to the libraries listed below
- Streaming demo dbt☆17Updated 7 months ago
- ☆30Updated 10 months ago
- A flake8 plugin that detects of usage withColumn in a loop or inside reduce☆27Updated 3 months ago
- PyJaws: A Pythonic Way to Define Databricks Jobs and Workflows☆43Updated 9 months ago
- Code samples, etc. for Databricks☆64Updated last month
- Custom PySpark Data Sources☆50Updated last week
- This repo is a collection of tools to deploy, manage and operate a Databricks based Lakehouse.☆45Updated 3 months ago
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆23Updated last year
- Delta lake and filesystem helper methods☆51Updated last year
- Spark and Delta Lake Workshop☆22Updated 2 years ago
- ML Ops Accelerator: Databricks & Azure Machine Learning Unification☆74Updated 9 months ago
- An end-to-end Recommendation System built on Azure Databricks☆53Updated 5 years ago
- A simple VS Code devcontainer setup for local PySpark development☆50Updated last year
- Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline☆152Updated 8 months ago
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10Updated last year
- ☆40Updated 3 years ago
- Demonstration of using Files in Repos with Databricks Delta Live Tables☆32Updated 9 months ago
- devops-for-databricks☆61Updated 10 months ago
- dbt adapter for Azure Synapse Dedicated SQL Pools☆71Updated 2 weeks ago
- Prescriptive guidance for building, deploying, and monitoring machine learning models with Azure Databricks using containers in line with…☆23Updated this week
- MLOps using Azure Databricks, Azure DevOps and Azure ML Services☆56Updated 4 years ago
- ☆10Updated 3 years ago
- Delta Lake helper methods. No Spark dependency.☆23Updated 7 months ago
- Example code for doing DataOps☆47Updated 4 years ago
- PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it☆50Updated last week
- SQL Queries & Alerts for Databricks System Tables access.audit Logs☆27Updated 6 months ago
- Guided accelerator consolidating best practice patterns, IaaC and AML code artefacts to provide a reference approach to implementing MLOp…☆47Updated last year
- A "modern" Strava data pipeline fueled by dlt, duckdb, dbt, and evidence.dev☆32Updated 3 months ago
- Examples surrounding Databricks.☆58Updated 10 months ago
- Stream Data from Databricks Directly to PowerBI, and CosmosDB!☆12Updated 6 years ago