JuanARojasA / dataflat
☆11Updated 2 months ago
Alternatives and similar repositories for dataflat:
Users that are interested in dataflat are comparing it to the libraries listed below
- ☆12Updated last year
- An example CI/CD pipeline using GitHub Actions for doing continuous deployment of AWS Glue jobs built on PySpark and Jupyter Notebooks.☆12Updated 4 years ago
- ☆15Updated 3 years ago
- Writing PySpark logs in Apache Spark and Databricks☆16Updated 2 years ago
- ☆26Updated last month
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆43Updated 2 years ago
- Run Apache Airflow on OpenShift☆14Updated 3 years ago
- A platform and cloud-based service for data sharing based on the Delta Sharing protocol.☆21Updated 10 months ago
- ☆34Updated 11 months ago
- ☆16Updated 8 months ago
- Powershell Scripts for Power BI☆12Updated last year
- This repository contains NiFi processors for interacting with Snowflake Cloud Data Platform.☆12Updated 4 months ago
- RAG application (backend & frontend) with sources retriveal and highlighting on the Databricks Platform☆10Updated last week
- ☆13Updated last year
- PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it☆48Updated last week
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 2 years ago
- A Streamlit app that provides insights on your Snowflake account usage.☆59Updated 2 months ago
- Companion repository for the book 'Delta Lake Up and Running'☆46Updated 2 weeks ago
- A series of workshop modules introducing Feast feature store.☆19Updated 2 years ago
- Code snippets for Data Engineering Design Patterns book☆78Updated last month
- Full stack data engineering tools and infrastructure set-up☆51Updated 4 years ago
- Examples surrounding Databricks.☆58Updated 9 months ago
- A Table format agnostic data sharing framework☆38Updated last year
- Spark app to merge different schemas☆23Updated 4 years ago
- Data Engineering with Databricks Cookbook, published by Packt☆80Updated 10 months ago
- Spark runtime on AWS Lambda☆107Updated 7 months ago
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆23Updated last year
- Hands-On Data Warehousing with Azure Data Factory, published by Packt☆13Updated 2 years ago
- Delta Lake helper methods. No Spark dependency.☆23Updated 7 months ago
- ☆181Updated 4 years ago