dacort / modern-data-lake-storage-layersView external linksLinks
Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work
☆47Jul 13, 2022Updated 3 years ago
Alternatives and similar repositories for modern-data-lake-storage-layers
Users that are interested in modern-data-lake-storage-layers are comparing it to the libraries listed below
Sorting:
- A serverless datalake project and framework based on AWS S3,Glue,Athena,MWAA and QuickSight. With a series of best practices, it guides y…☆16Nov 22, 2022Updated 3 years ago
- Amazon EMR Notebook to show how to read from and write to Delta tables with Amazon EMR☆17Apr 27, 2025Updated 9 months ago
- ☆16Sep 25, 2023Updated 2 years ago
- This repository provides the resources required for the Amazon Redshift Streaming workshop☆13Jul 12, 2023Updated 2 years ago
- Code to munge data between Kaggle .tsv Rotten Tomatoes Sentiment Analysis data set and Vowpal Wabbit☆24Jun 22, 2014Updated 11 years ago
- ☆20Jan 19, 2024Updated 2 years ago
- This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.☆52Oct 31, 2023Updated 2 years ago
- ☆18Jun 16, 2024Updated last year
- Machine learning enhancements to Spark MlLib☆20Mar 19, 2015Updated 10 years ago
- A Caddy server module that provides a REST API for DuckDB database operations with built-in authentication and authorization.☆77Nov 27, 2025Updated 2 months ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Feb 1, 2026Updated 2 weeks ago
- Learn How To Observe, Manage, and Scale, Agentic AI Apps Using Azure AI Foundry - with this hands-on workshop☆39Feb 5, 2026Updated last week
- Houses the proto files and examples for Fivetran's Partner SDK☆14Feb 5, 2026Updated last week
- asw.cluster R package for calculating group faultlines☆12Aug 20, 2023Updated 2 years ago
- An end-to-end open-source data stack for crawling and visualizing real estate data, facilitating insights into market trends.☆15May 23, 2024Updated last year
- Speech ANDroid Apps☆20Jan 22, 2014Updated 12 years ago
- Pytorch Implementation of the Explainable Conditional Adversarial Autoencoder using Saliency Maps and SHAP (J. of Imaging - MDPI)☆12Mar 5, 2025Updated 11 months ago
- Repository for the paper "Discovering and Categorising Language Biases in Reddit" accepted at the International Conference on Web and Soc…☆12Aug 20, 2024Updated last year
- Python3, NetworkX, Java, MLlib, Spark, Cassandra, Neo4j 3.0, Gephi, Docker☆11Jul 18, 2017Updated 8 years ago
- Short course on subsurface data analytics and machine learning.☆10May 18, 2019Updated 6 years ago
- Making the transition from Scratch to Python☆11Apr 11, 2017Updated 8 years ago
- [DEPRECATED] Template for setting up a Gardener landscape using landscape-setup☆16Mar 7, 2020Updated 5 years ago
- This UE4 project contains the Telekinesis Mechanic for Control☆11Jul 26, 2020Updated 5 years ago
- Pair Trading Analysis & Exercises Toolkit [Jupyter Notebook]☆12Nov 3, 2023Updated 2 years ago
- Streamlit application to explore Snowflake Tables☆49Oct 28, 2023Updated 2 years ago
- This project uses PySpark and Python to analyze a Google Play Store dataset. It covers data cleaning, duplicate removal, and visual analy…☆12Apr 6, 2022Updated 3 years ago
- Source code from Billy Newports blog☆19Nov 24, 2014Updated 11 years ago
- Proof of concept of a big data cluster using open source tools☆11Apr 10, 2024Updated last year
- ☆11Mar 3, 2024Updated last year
- This project aims at giving the best customer service ever using the power of LLM models like GPT.☆10Jun 29, 2023Updated 2 years ago
- Analyzing NBA Data☆11Feb 19, 2015Updated 10 years ago
- Build Multi-Account and Multi-VPC AWS network infrastructure with Network Shared Services (NSS)☆11Apr 28, 2025Updated 9 months ago
- These scripts clean the unused EBS volumes, AMIs and snapshots on Amazon Web Services.☆11Jul 24, 2015Updated 10 years ago
- "유닉스 리눅스 셸 스크립트 예제 사전: Unix & Linux Shell Script Exercise Dictionary" - 한빛미디어☆10Jan 17, 2017Updated 9 years ago
- ☆15Apr 4, 2021Updated 4 years ago
- Leveraging Apache CTakes and Azure Search to Build and Medical Search App☆11May 14, 2019Updated 6 years ago
- ☆19Dec 1, 2025Updated 2 months ago
- This is a demo for HTTP MCP Server written in Python☆18Jul 30, 2025Updated 6 months ago
- Examples from Rob's Awesome Python Template☆14Updated this week