This repository is for demonstrating the capability to do SQL-based UPDATES, DELETES, and INSERTS directly in the Data Lake using Amazon S3, AWS Glue and Delta Lake.
☆18Aug 25, 2021Updated 4 years ago
Alternatives and similar repositories for aws-glue-delta-lake
Users that are interested in aws-glue-delta-lake are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Automated Account Configuration is a sample solution to enable operational scale for AWS customers by automating repeatable steps req…☆14Jul 26, 2023Updated 2 years ago
- Dask on ECS Fargate☆14Sep 23, 2019Updated 6 years ago
- A Wordpress AWS install controlled by Terraform 0.11.x☆11Apr 19, 2026Updated last month
- Source code for 'Up and Running with DAX for Power BI' by Alison Box☆12Jun 10, 2022Updated 3 years ago
- duckdb-etl-framework☆15Dec 20, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- a simple lakeFS webhook for pre-commit and pre-merge validation of data objects☆13Nov 9, 2023Updated 2 years ago
- A project to design a fact and dimension star schema for optimizing queries on a flight booking database using PostgreSQL, a relational d…☆12Aug 15, 2021Updated 4 years ago
- The goal of this project is to analyse the impact of Covid-19 on the Aviation industry through data engineering processes using technolog…☆13Jun 26, 2022Updated 3 years ago
- ☆18Aug 15, 2022Updated 3 years ago
- ☆17Nov 26, 2024Updated last year
- This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenario…☆27Mar 17, 2026Updated 2 months ago
- Demo for GitHub Universe 2022☆13Jan 31, 2023Updated 3 years ago
- Example code that launches a docker container on AWS Fargate from AWS Lambda☆18Dec 24, 2017Updated 8 years ago
- Repository for the paper "Discovering and Categorising Language Biases in Reddit" accepted at the International Conference on Web and Soc…☆12Aug 20, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆13Feb 18, 2022Updated 4 years ago
- Social Media Analysis, scalable solution, flexible deployment that analyses social media contents☆10Jul 20, 2023Updated 2 years ago
- ☆15Apr 4, 2021Updated 5 years ago
- Resilient Automation Functions and Scripts☆15Jan 5, 2022Updated 4 years ago
- Run dynamic SQL in SQL. This package allows queries with an unknown number of select-list items and can solve challenging problems like d…☆12Oct 5, 2024Updated last year
- ☆18Apr 10, 2025Updated last year
- Building pipeline to process the real-time data using Spark and Mongodb.☆12Oct 30, 2019Updated 6 years ago
- ☆17Mar 7, 2021Updated 5 years ago
- Matatika Community Edition☆28Apr 23, 2026Updated last month
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A code sample that allows you to send a payload from the Twitter API to Google Sheets.☆18Mar 23, 2021Updated 5 years ago
- Data warehouse tech stack with PostgreSQL, DBT and Airflow☆20Dec 29, 2025Updated 5 months ago
- A dotnet standard wrapper for the Uniswap V2 Subgraph on The Graph GraphQL API.☆12Dec 17, 2020Updated 5 years ago
- ☆17May 16, 2020Updated 6 years ago
- basic implementation of a bunch of optimization algorithms☆13Jul 7, 2017Updated 8 years ago
- Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as …☆17Oct 1, 2019Updated 6 years ago
- ☆23Nov 17, 2019Updated 6 years ago
- ☆21Mar 24, 2016Updated 10 years ago
- A tutorial on building a real-time data streaming application pipeline with Apache Kafka🔥🔥🔥☆24Apr 29, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Java client for EventStore (http://geteventstore.com)☆20May 25, 2015Updated 11 years ago
- Boto S3 Router provides a Boto3-like client that routes requests between S3 clients according to the bucket and the key in the request.☆19Mar 3, 2022Updated 4 years ago
- Local AWS EMR - A local service that imitates AWS EMR☆27Jul 5, 2023Updated 2 years ago
- A collection of CLI LLM tools that I built and use daily☆15Aug 7, 2024Updated last year
- Systematic dataset of Brazilian sub-national Covid-19 policy and survey data☆16Jun 22, 2023Updated 2 years ago
- Where the Meltano team runs Meltano! Get it???☆31Apr 9, 2025Updated last year
- dbt / Amazon Redshift Demonstration Project☆34Jan 3, 2023Updated 3 years ago