Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work
☆47Jul 13, 2022Updated 3 years ago
Alternatives and similar repositories for modern-data-lake-storage-layers
Users that are interested in modern-data-lake-storage-layers are comparing it to the libraries listed below
Sorting:
- Amazon EMR Notebook to show how to read from and write to Delta tables with Amazon EMR☆17Apr 27, 2025Updated 10 months ago
- ☆16Sep 25, 2023Updated 2 years ago
- This repository provides the resources required for the Amazon Redshift Streaming workshop☆13Jul 12, 2023Updated 2 years ago
- Code to munge data between Kaggle .tsv Rotten Tomatoes Sentiment Analysis data set and Vowpal Wabbit☆24Jun 22, 2014Updated 11 years ago
- ☆20Jan 19, 2024Updated 2 years ago
- This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.☆52Oct 31, 2023Updated 2 years ago
- ☆18Jun 16, 2024Updated last year
- A Caddy server module that provides a REST API for DuckDB database operations with built-in authentication and authorization.☆78Nov 27, 2025Updated 3 months ago
- Demos, requested from my trainees and based on my daily work with AWS.☆10Jun 21, 2022Updated 3 years ago
- Learn How To Observe, Manage, and Scale, Agentic AI Apps Using Azure AI Foundry - with this hands-on workshop☆39Feb 5, 2026Updated last month
- Short course on subsurface data analytics and machine learning.☆10May 18, 2019Updated 6 years ago
- A Kivy tutorial for PyOhio 2013☆14Apr 30, 2014Updated 11 years ago
- Repository for the paper "Discovering and Categorising Language Biases in Reddit" accepted at the International Conference on Web and Soc…☆12Aug 20, 2024Updated last year
- ☆11Oct 13, 2025Updated 4 months ago
- Example lua scripts for ATS ts_lua plugin☆12Nov 6, 2025Updated 4 months ago
- ☆20Updated this week
- Speech ANDroid Apps☆20Jan 22, 2014Updated 12 years ago
- [DEPRECATED] Template for setting up a Gardener landscape using landscape-setup☆16Mar 7, 2020Updated 6 years ago
- Python3, NetworkX, Java, MLlib, Spark, Cassandra, Neo4j 3.0, Gephi, Docker☆11Jul 18, 2017Updated 8 years ago
- An end-to-end open-source data stack for crawling and visualizing real estate data, facilitating insights into market trends.☆15May 23, 2024Updated last year
- Pytorch Implementation of the Explainable Conditional Adversarial Autoencoder using Saliency Maps and SHAP (J. of Imaging - MDPI)☆12Mar 5, 2025Updated last year
- Coursera Machine Learning class examples in Spark☆43Feb 14, 2014Updated 12 years ago
- Document classification with Apache Spark on an American Classic☆10Sep 25, 2015Updated 10 years ago
- Rope collision in cpp