This repo contains live examples to build Databricks' Lakehouse and recommended best practices from the field.
☆23Oct 15, 2024Updated last year
Alternatives and similar repositories for databricks-lakehouse
Users that are interested in databricks-lakehouse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SCD Merge Wizard is an application which will help you generate T-SQL statement for merging data from two tables into one table in minute…☆44Sep 4, 2024Updated last year
- Written python files to work with pNEUMA dataset☆22May 18, 2021Updated 4 years ago
- StarSnow: HTTP Client for Snowflake database (HTTP get/post from SQL)☆26Oct 6, 2022Updated 3 years ago
- ☆29Aug 17, 2018Updated 7 years ago
- A benchmark tool for lakehouses.☆14Mar 12, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Python based Wikidata framework for easy dataframe extraction☆45Feb 21, 2026Updated last month
- Collect NBA injuries report, organize them in an elegant table, then send it via mail☆10Jan 12, 2021Updated 5 years ago
- A comprehensive set of calendar table value functions, for use in calendar dimensions or other applications.☆13Sep 10, 2020Updated 5 years ago
- PredictHQ’s Data Science documentation☆14Apr 1, 2026Updated last week
- A modern relational spreadsheet 🌈☆51Mar 3, 2023Updated 3 years ago
- ☆17Apr 8, 2023Updated 3 years ago
- Database reverse engineering☆51Nov 1, 2023Updated 2 years ago
- ☆10Jan 28, 2025Updated last year
- All sorts of things supporting blog posts... Sub folders per blog post title.☆40Jan 30, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code snippets and tools published on the blog at lifearounddata.com☆12Jan 19, 2020Updated 6 years ago
- This is a public repository that the dbt proserv team uses for collective demos.☆15Mar 20, 2026Updated 3 weeks ago
- ☆57Dec 1, 2023Updated 2 years ago
- Scala library for parsing fixed length file format☆13Oct 19, 2021Updated 4 years ago
- ☆21Feb 27, 2026Updated last month
- End-to-End examples that show how to solve business problems using Amazon SageMaker and it's ML/DL algorithm.☆17Jun 12, 2023Updated 2 years ago
- F# interactive service API exposed via a lightweight HTTP server☆13Mar 6, 2018Updated 8 years ago
- Git Repo for EDW Best Practice Assets on the Lakehouse☆16Dec 11, 2023Updated 2 years ago
- Resources for the demo of First Order Motion Model for Image Animation☆17Dec 8, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Resources for the Udemy Course - Azure Databricks & Spark Core For Data Engineers(Python/SQL) by Ramesh Retnasamy☆33Aug 23, 2024Updated last year
- A docker using the airflow with Hadoop ecosystem (hive, spark, and sqoop)☆12May 2, 2021Updated 4 years ago
- Data lake, data warehouse on GCP☆58Dec 28, 2021Updated 4 years ago
- A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino☆22May 30, 2022Updated 3 years ago
- Databricks CI/CD using Azure DevOps☆21Nov 1, 2022Updated 3 years ago
- ☆13Jul 8, 2025Updated 9 months ago
- Project repository of Apache Airflow, deployed on Docker in Amazon EC2 via GitLab.☆15Sep 3, 2021Updated 4 years ago
- Code files for Mastering JBoss Drools 6, published by Packt☆11Sep 12, 2023Updated 2 years ago
- Supplemental Scripts and Code for Angular for Enterprise Ready Web Apps☆10May 21, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆70Mar 1, 2026Updated last month
- DIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics control framework that can be used to monitor, log, aud…☆29Feb 14, 2026Updated 2 months ago
- Bypass Google Recaptcha V2 using undetected driver with proxy rotator.☆17Nov 30, 2023Updated 2 years ago
- Agentic Architectural Patterns for Building Multi-Agent Systems, published by Packt☆76Mar 2, 2026Updated last month
- A comprehensive list of annotated training datasets classified by use case.☆38Jul 8, 2022Updated 3 years ago
- Ravi Azure ADB ADF Repository☆65Jan 25, 2025Updated last year
- A simple cookiecutter template for Scrapy projects☆20Dec 14, 2022Updated 3 years ago