Companion repository for the book 'Delta Lake Up and Running'
☆49Apr 5, 2025Updated last year
Alternatives and similar repositories for delta-lake-up-and-running
Users that are interested in delta-lake-up-and-running are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Feb 19, 2025Updated last year
- Writing PySpark logs in Apache Spark and Databricks☆17Jun 13, 2022Updated 3 years ago
- Deployed an kafka instance in AWS EC2 Instance to streamline the data into Databricks☆10Aug 15, 2023Updated 2 years ago
- ☆62Feb 1, 2025Updated last year
- Most Complete Pytorch Imeplementation "GENERALIZED END-TO-END LOSS FOR SPEAKER VERIFICATION"☆10Mar 11, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆16Apr 1, 2025Updated last year
- Analysis code for Schmaelzle, O'Donnell et al. (2017) PNAS☆10May 7, 2017Updated 9 years ago
- Therapixel solution of 2017's kaggle challenge on lung cancer detection☆14Feb 9, 2018Updated 8 years ago
- Implementing CI/CD Using Azure Pipelines, published by Packt☆13Nov 29, 2023Updated 2 years ago
- ☆13Oct 12, 2020Updated 5 years ago
- Data Observability for Data Engineering, published by Packt Publishing☆11Jan 24, 2025Updated last year
- Repository for Databricks And Azure Maps Online Workshop Series☆17Mar 21, 2022Updated 4 years ago
- Using Selenium and Beautiful Soup to scrape marathon images☆10Feb 21, 2019Updated 7 years ago
- ☆30Jul 2, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆17Apr 2, 2024Updated 2 years ago
- Pandas helper functions☆31Feb 19, 2023Updated 3 years ago
- Spark and Delta Lake Workshop☆22Jun 14, 2022Updated 3 years ago
- ☆64Jan 9, 2024Updated 2 years ago
- ☆12Apr 1, 2026Updated last month
- Magic to help Spark pipelines upgrade☆34Sep 29, 2024Updated last year
- Code examples for my blog posts☆22Nov 7, 2018Updated 7 years ago
- Plugin for Intake to read from SQL servers☆15May 29, 2023Updated 2 years ago
- Document parameters using comments☆10Aug 6, 2021Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- A curated list of dagster code snippets for data engineers☆56Feb 26, 2024Updated 2 years ago
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- Code to demonstrate data engineering metadata & logging best practices☆21Mar 12, 2024Updated 2 years ago
- ☆12Jul 22, 2025Updated 9 months ago
- GDS Patient Journey Demo☆13Jul 25, 2023Updated 2 years ago
- Run streamlit web application, test and deploy to a cloud service (GCP, AWS, Heroku)☆14Oct 8, 2022Updated 3 years ago
- Udacity Data Engineering Nanodegree Projects☆11Sep 5, 2019Updated 6 years ago
- Code for dbt tutorial☆174Sep 9, 2025Updated 8 months ago
- ☆21Dec 11, 2021Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- A Postgres data warehouse for processing synthetic data using IAC principles☆19Feb 27, 2023Updated 3 years ago
- Material for the Berlin Bayesian reading group covering Statistical Rethinking by Richard McElreath☆10May 7, 2020Updated 6 years ago
- Provide functionality to build statistical models to repair dirty tabular data in Spark☆12Apr 21, 2023Updated 3 years ago
- ☆14Feb 23, 2021Updated 5 years ago
- Run Ollama on Kubernetes☆46Nov 19, 2024Updated last year
- Birds 400-Species Image Classification using Pytorch Metric Learning (Triplet Margin Loss)☆13Nov 1, 2022Updated 3 years ago
- Source Code for 'Beginning Apache Spark 3' by Hien Luu☆13Oct 14, 2021Updated 4 years ago