klescosia/aws-glue-delta-lake

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/klescosia/aws-glue-delta-lake)

klescosia / aws-glue-delta-lake

This repository is for demonstrating the capability to do SQL-based UPDATES, DELETES, and INSERTS directly in the Data Lake using Amazon S3, AWS Glue and Delta Lake.

☆18

Alternatives and similar repositories for aws-glue-delta-lake

Users that are interested in aws-glue-delta-lake are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aws-samples / aws-glue-job-status-email-report
View on GitHub
☆18Aug 15, 2022Updated 3 years ago
Apress / up-and-running-w-dax-for-power-bi
View on GitHub
Source code for 'Up and Running with DAX for Power BI' by Alison Box
☆12Jun 10, 2022Updated 4 years ago
IvanWoo / trino-on-kubernetes
View on GitHub
☆10May 5, 2022Updated 4 years ago
lmassaoy / docker-rastreio-correios
View on GitHub
(PT-BR) O objetivo deste projeto é executar uma aplicação Python dentro de um container Docker (ou container Kubernetes utilizando a dock…
☆12Jun 23, 2020Updated 6 years ago
soumilshah1995 / duckdb-etl-framework
View on GitHub
duckdb-etl-framework
☆14Dec 20, 2024Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
treeverse / lakeFS-hooks
View on GitHub
a simple lakeFS webhook for pre-commit and pre-merge validation of data objects
☆13Nov 9, 2023Updated 2 years ago
alexguimenti / whatsappAudioSpeedChanger
View on GitHub
🔊 A Google Chrome Extension to change audio speed on Web Whatsapp.
☆10Jun 28, 2020Updated 6 years ago
siddharth271101 / Covid-19-and-Aviation-Industry
View on GitHub
The goal of this project is to analyse the impact of Covid-19 on the Aviation industry through data engineering processes using technolog…
☆13Jun 26, 2022Updated 4 years ago
jeromevdl / boto3-lambda-layer
View on GitHub
Shell script that creates an AWS Lambda Layer with the specified (or latest) version of boto3
☆11May 29, 2019Updated 7 years ago
wesleyosantos91 / poc-micronaut-kotlin-grpc
View on GitHub
Prova de conceito - Micronaut, Kotlin e GRPC
☆12Mar 16, 2021Updated 5 years ago
e6data / awesome-optimizing-iceberg-tables
View on GitHub
☆17Nov 26, 2024Updated last year
RealKinetic / aws-glue-pipeline-example
View on GitHub
An example CI/CD pipeline using GitHub Actions for doing continuous deployment of AWS Glue jobs built on PySpark and Jupyter Notebooks.
☆13Oct 15, 2020Updated 5 years ago
aws-samples / data-purging-aws-data-lake
View on GitHub
☆22Jul 14, 2020Updated 6 years ago
aws-samples / aws-amplify-social-network-app-workshop
View on GitHub
☆16Mar 18, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ranjbaryshahab / postgres-cdc-clickhouse
View on GitHub
Change Data Capture (CDC) from PostgreSQL to ClickHouse
☆16Jul 15, 2024Updated 2 years ago
Renien / docker-spark-livy
View on GitHub
Spark Standalone & Livy
☆11Jul 13, 2021Updated 5 years ago
method5 / method4
View on GitHub
Run dynamic SQL in SQL. This package allows queries with an unknown number of select-list items and can solve challenging problems like d…
☆12Oct 5, 2024Updated last year
lmassaoy / spark-on-k8s
View on GitHub
Presenting 3 ways to run Spark over containers, this project is recommended to those who seek to explore Big Data out of a Hadoop Cluster…
☆11Nov 25, 2020Updated 5 years ago
dacort / ci-cd-serverless-spark
View on GitHub
Demo for GitHub Universe 2022
☆13Jan 31, 2023Updated 3 years ago
ljaviertovar / jsonserver-api-rest
View on GitHub
API REST developed with JSON Server
☆16Sep 8, 2022Updated 3 years ago
xfold / LanguageBiasesInReddit
View on GitHub
Repository for the paper "Discovering and Categorising Language Biases in Reddit" accepted at the International Conference on Web and Soc…
☆12Aug 20, 2024Updated last year
sp6370 / Job-Application-Tracker
View on GitHub
Simplified job application tracker using Notion API powered by TypeScript and Selenium.
☆12Aug 28, 2023Updated 2 years ago
wesleyosantos91 / poc-springboot-kafka
View on GitHub
Prova de conceito - Springboot, Java, Schema Registry, Apache Avro e Apache Kafka .
☆14Apr 18, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Akrog / gcs-client
View on GitHub
Google Cloud Storage Python Client
☆14Dec 26, 2022Updated 3 years ago
manishkr1754 / NIFTY50_Data_Analysis_NSETOOLS_NSEPY_Python
View on GitHub
NIFTY50 Data Analysis from scratch (Data Extraction & Visualization to Investment Insights)
☆17May 20, 2023Updated 3 years ago
romulovieira777 / Data_Engineering_Essentials_Hands_on_SQL_Python_and_Spark
View on GitHub
☆13Feb 18, 2022Updated 4 years ago
enthought / Numpy-Tutorial-SciPyConf-2022
View on GitHub
Public GitHub repo for SciPy 2022 tutorial (Introduction to Numerical Computing With NumPy)
☆13Aug 24, 2022Updated 3 years ago
semashkinvg / DataVault
View on GitHub
☆16Jan 20, 2019Updated 7 years ago
joelgrus / polyglot-twitter-bot
View on GitHub
code for writing twitter bots in several languages
☆13Dec 31, 2015Updated 10 years ago
Pahulpreet86 / Real-Time-Data-Pipeline-Using-Kafka-and-Spark
View on GitHub
☆16Feb 17, 2020Updated 6 years ago
koksang / social-media-analysis
View on GitHub
Social Media Analysis, scalable solution, flexible deployment that analyses social media contents
☆10Jul 20, 2023Updated 3 years ago
rpkilby / SurveyGizmo
View on GitHub
Wrapper for SurveyGizmo's restful API service
☆16Sep 24, 2020Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ecloudvalley / Building-a-Data-Lake-with-AWS-Glue-and-Amazon-S3
View on GitHub
☆18Nov 16, 2018Updated 7 years ago
alexdebrie / serverless-dynamodb-scanner
View on GitHub
A Serverless project to help you operate on every existing item in a DynamoDB table
☆17Mar 5, 2019Updated 7 years ago
rajrohan / spark-streaming-twitter
View on GitHub
Building pipeline to process the real-time data using Spark and Mongodb.
☆12Oct 30, 2019Updated 6 years ago
antoniopapa / go-ambassador
View on GitHub
☆20Oct 12, 2021Updated 4 years ago
ptlis / psr7-conneg
View on GitHub
PSR-7 Content negotiation
☆14Jun 16, 2015Updated 11 years ago
isaaclucky / data-warehousing
View on GitHub
Data warehouse tech stack with PostgreSQL, DBT and Airflow
☆20Dec 29, 2025Updated 7 months ago
LaravelDaily / Flutter-Public-API-Demo
View on GitHub
☆12Nov 25, 2020Updated 5 years ago