This repo contains live examples to build Databricks' Lakehouse and recommended best practices from the field.
☆22Oct 15, 2024Updated last year
Alternatives and similar repositories for databricks-lakehouse
Users that are interested in databricks-lakehouse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SCD Merge Wizard is an application which will help you generate T-SQL statement for merging data from two tables into one table in minute…☆44Sep 4, 2024Updated last year
- Written python files to work with pNEUMA dataset☆22May 18, 2021Updated 5 years ago
- StarSnow: HTTP Client for Snowflake database (HTTP get/post from SQL)☆26Oct 6, 2022Updated 3 years ago
- A benchmark tool for lakehouses.☆14Mar 12, 2023Updated 3 years ago
- Property Casualty Data Model Specification☆36Jun 22, 2022Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Python based Wikidata framework for easy dataframe extraction☆45Feb 21, 2026Updated 4 months ago
- Collect NBA injuries report, organize them in an elegant table, then send it via mail☆10Jan 12, 2021Updated 5 years ago
- A comprehensive set of calendar table value functions, for use in calendar dimensions or other applications.☆13Sep 10, 2020Updated 5 years ago
- A modern relational spreadsheet 🌈☆51Mar 3, 2023Updated 3 years ago
- This repo will serve to simplify the steps to generate a compatible zip archive for creating AWS Lambda layers. With some additional chan…☆12Mar 18, 2024Updated 2 years ago
- Database reverse engineering☆50Nov 1, 2023Updated 2 years ago
- ☆10Jan 28, 2025Updated last year
- All sorts of things supporting blog posts... Sub folders per blog post title.☆40Jan 30, 2023Updated 3 years ago
- ☆12Mar 15, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [MOVED to Data Engine Thinking] A library for data warehouse and data integration pattern and architecture documentation.☆51Jan 30, 2026Updated 5 months ago
- Code snippets and tools published on the blog at lifearounddata.com☆12Jan 19, 2020Updated 6 years ago
- A tool to create Airflow RBAC roles with dag-level permissions from cli.☆13Sep 7, 2023Updated 2 years ago
- ☆58Dec 1, 2023Updated 2 years ago
- Scala library for parsing fixed length file format☆13Oct 19, 2021Updated 4 years ago
- End-to-End examples that show how to solve business problems using Amazon SageMaker and it's ML/DL algorithm.☆17Jun 12, 2023Updated 3 years ago
- Python wrapper for the collegefootballapi☆14May 22, 2023Updated 3 years ago
- F# interactive service API exposed via a lightweight HTTP server☆13Mar 6, 2018Updated 8 years ago
- ☆17Oct 29, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Data lake, data warehouse on GCP☆58Dec 28, 2021Updated 4 years ago
- Repo for all sports code and research papers☆14Jun 7, 2026Updated 3 weeks ago
- A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino☆22May 30, 2022Updated 4 years ago
- ☆11Dec 6, 2022Updated 3 years ago
- Automatic Feature Engineering for Time Series☆18Jan 2, 2026Updated 6 months ago
- Databricks CI/CD using Azure DevOps☆21Nov 1, 2022Updated 3 years ago
- Autoregressive Bayesian linear model☆21Sep 10, 2020Updated 5 years ago
- ☆13Jul 8, 2025Updated 11 months ago
- Project repository of Apache Airflow, deployed on Docker in Amazon EC2 via GitLab.☆16Sep 3, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Este é um projeto de exemplo que demonstra um processo de ETL (Extração, Transformação e Carga) de dados usando Python, Polars e AWS Loca…☆15Sep 25, 2023Updated 2 years ago
- Supplemental Scripts and Code for Angular for Enterprise Ready Web Apps☆10May 21, 2018Updated 8 years ago
- Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables☆75Nov 21, 2023Updated 2 years ago
- An idiomatic Scala wrapper around the AWS Java SDK☆22Dec 23, 2021Updated 4 years ago
- ☆69Jun 21, 2026Updated last week
- DIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics control framework that can be used to monitor, log, aud…☆29Feb 14, 2026Updated 4 months ago
- Ravi Azure ADB ADF Repository☆65Jan 25, 2025Updated last year