benniehaelen / delta-lake-up-and-runningView external linksLinks
Companion repository for the book 'Delta Lake Up and Running'
☆48Apr 5, 2025Updated 10 months ago
Alternatives and similar repositories for delta-lake-up-and-running
Users that are interested in delta-lake-up-and-running are comparing it to the libraries listed below
Sorting:
- ☆13Feb 19, 2025Updated 11 months ago
- Writing PySpark logs in Apache Spark and Databricks☆17Jun 13, 2022Updated 3 years ago
- Spark and Delta Lake Workshop☆22Jun 14, 2022Updated 3 years ago
- Esse repositório contém todos os códigos que desenvolvi durante o curso Leitura e Manipulação de Dados em Python oferecido no meu canal n…☆13Aug 9, 2021Updated 4 years ago
- ☆17Dec 2, 2024Updated last year
- ☆13Feb 15, 2025Updated last year
- Spark operator deployment and usage on OpenShift☆29Nov 25, 2024Updated last year
- Magic to help Spark pipelines upgrade☆34Sep 29, 2024Updated last year
- ☆12Nov 26, 2025Updated 2 months ago
- Java implementation of the EbMS 2.0 specification.☆10Feb 10, 2026Updated last week
- Using Selenium and Beautiful Soup to scrape marathon images☆10Feb 21, 2019Updated 6 years ago
- GitHub Copilot Adoption Plan - Workshops - Labs☆18Sep 18, 2025Updated 4 months ago
- This repository contains NiFi processors for interacting with Snowflake Cloud Data Platform.☆12Dec 13, 2024Updated last year
- Python - Complete Python, Django, Data Science and ML Guide, published by Packt☆14Dec 15, 2025Updated 2 months ago
- Data Observability for Data Engineering, published by Packt Publishing☆11Jan 24, 2025Updated last year
- Scan your AI/ML models for problems before you put them into production.☆11Mar 31, 2025Updated 10 months ago
- This project represents a whole process of Anime data collection, preparation, and delivery as a data app, powered by technologies like P…☆10Oct 4, 2022Updated 3 years ago
- Theo dõi biến động giá sản phẩm TIKI với Github Actions☆14Jan 16, 2022Updated 4 years ago
- Source code for the module "Advanced Statistics" 📊☆10Feb 25, 2019Updated 6 years ago
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- An example CI/CD pipeline using GitHub Actions for doing continuous deployment of AWS Glue jobs built on PySpark and Jupyter Notebooks.☆13Oct 15, 2020Updated 5 years ago
- Plugin for Intake to read from SQL servers☆15May 29, 2023Updated 2 years ago
- Dutch data.☆10Nov 12, 2025Updated 3 months ago
- Limit long text output for a single JupyterLab mime render.☆13Jul 30, 2025Updated 6 months ago
- LUNA: a Framework for Language Understanding and Naturalness Assessment.☆12Sep 9, 2023Updated 2 years ago
- A helper script to ease some of the pain involved in mandatory MFA and Cross-Account Roles using the CLI.☆11Jun 21, 2022Updated 3 years ago
- Command line client for the Fugue API☆14Mar 7, 2023Updated 2 years ago
- Events about the open source data stack☆13Apr 16, 2022Updated 3 years ago
- Provide functionality to build statistical models to repair dirty tabular data in Spark☆12Apr 21, 2023Updated 2 years ago
- Sparse Embedding Compression for Scalable Retrieval in Recommender Systems☆33Nov 21, 2025Updated 2 months ago
- Researching the forward-backward algorithm☆11Aug 3, 2018Updated 7 years ago
- Auto-mirror of scoopinstaller/scoop-main bucket☆12Updated this week
- Eclipse MicroProfile based Java Microservice running in Payara Micro☆10Nov 4, 2019Updated 6 years ago
- Easily Deploy Code to AWS Lambda☆13Aug 15, 2018Updated 7 years ago
- Apache NiFi deployment on OpenShift☆13Jul 18, 2023Updated 2 years ago
- Source Code for 'Beginning Apache Spark 3' by Hien Luu☆13Oct 14, 2021Updated 4 years ago
- Assessing Disparate Impacts of Personalized Interventions: Identifiability and Bounds☆11Oct 28, 2019Updated 6 years ago
- Powershell Scripts for Power BI☆13Sep 20, 2023Updated 2 years ago
- Google Spreadsheets datasource for SparkSQL and DataFrames☆57Jul 24, 2023Updated 2 years ago