treeverse / lakeFS-samplesLinks
lakefs-samples repository
☆83Updated this week
Alternatives and similar repositories for lakeFS-samples
Users that are interested in lakeFS-samples are comparing it to the libraries listed below
Sorting:
- Data engineering with dbt, published by Packt☆79Updated last year
- Delta Lake Documentation☆49Updated last year
- Delta Lake examples☆225Updated 8 months ago
- Full stack data engineering tools and infrastructure set-up☆53Updated 4 years ago
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆254Updated 4 months ago
- New generation opensource data stack☆69Updated 3 years ago
- Quick Guides from Dremio on Several topics☆71Updated this week
- Unity Catalog UI☆40Updated 9 months ago
- ☆10Updated 3 years ago
- A Table format agnostic data sharing framework☆38Updated last year
- ☆90Updated 5 months ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆195Updated this week
- A DataOps framework for building a lakehouse.☆50Updated this week
- Official Dockerfile for Delta Lake☆53Updated last year
- The go to demo for public and private dbt Learn☆77Updated 3 months ago
- ☆43Updated 4 months ago
- Data-aware orchestration with dagster, dbt, and airbyte☆31Updated 2 years ago
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆70Updated 9 months ago
- PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it☆69Updated 2 months ago
- Delta Lake helper methods in PySpark☆326Updated 9 months ago
- Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline☆152Updated 10 months ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆84Updated 2 years ago
- ☆30Updated 11 months ago
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆142Updated 11 months ago
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architecture☆83Updated last week
- Demonstration of using Files in Repos with Databricks Delta Live Tables☆33Updated 11 months ago
- Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team …☆118Updated 2 weeks ago
- A Python Library to support running data quality rules while the spark job is running⚡☆188Updated this week
- Terraform templates for deploying mage-ai to AWS, GCP and Azure☆44Updated last year
- A write-audit-publish implementation on a data lake without the JVM☆46Updated 10 months ago