This repository is for demonstrating the capability to do SQL-based UPDATES, DELETES, and INSERTS directly in the Data Lake using Amazon S3, AWS Glue and Delta Lake.
☆18Aug 25, 2021Updated 4 years ago
Alternatives and similar repositories for aws-glue-delta-lake
Users that are interested in aws-glue-delta-lake are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Nov 26, 2024Updated last year
- ☆10May 5, 2022Updated 4 years ago
- A better SmartyStreets/LiveAddress API library for Python☆12Jan 2, 2025Updated last year
- duckdb-etl-framework☆15Dec 20, 2024Updated last year
- A project to design a fact and dimension star schema for optimizing queries on a flight booking database using PostgreSQL, a relational d…☆12Aug 15, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆18Aug 15, 2022Updated 3 years ago
- ☆17Nov 26, 2024Updated last year
- Apache Spark using SQL☆14Aug 18, 2021Updated 4 years ago
- Simplified job application tracker using Notion API powered by TypeScript and Selenium.☆12Aug 28, 2023Updated 2 years ago
- This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenario…☆27Mar 17, 2026Updated last month
- An example CI/CD pipeline using GitHub Actions for doing continuous deployment of AWS Glue jobs built on PySpark and Jupyter Notebooks.☆13Oct 15, 2020Updated 5 years ago
- 2020☆14Dec 8, 2022Updated 3 years ago
- Demo for GitHub Universe 2022☆13Jan 31, 2023Updated 3 years ago
- NIFTY50 Data Analysis from scratch (Data Extraction & Visualization to Investment Insights)☆16May 20, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Repository for the paper "Discovering and Categorising Language Biases in Reddit" accepted at the International Conference on Web and Soc…☆12Aug 20, 2024Updated last year
- ☆13Feb 18, 2022Updated 4 years ago
- Change Data Capture (CDC) from PostgreSQL to ClickHouse☆16Jul 15, 2024Updated last year
- ☆16Jan 20, 2019Updated 7 years ago
- code for writing twitter bots in several languages☆13Dec 31, 2015Updated 10 years ago
- Social Media Analysis, scalable solution, flexible deployment that analyses social media contents☆10Jul 20, 2023Updated 2 years ago
- Resilient Automation Functions and Scripts☆15Jan 5, 2022Updated 4 years ago
- Run dynamic SQL in SQL. This package allows queries with an unknown number of select-list items and can solve challenging problems like d…☆12Oct 5, 2024Updated last year
- ☆18Apr 10, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Building pipeline to process the real-time data using Spark and Mongodb.☆12Oct 30, 2019Updated 6 years ago
- A Serverless project to help you operate on every existing item in a DynamoDB table☆17Mar 5, 2019Updated 7 years ago
- Public GitHub repo for SciPy 2022 tutorial (Introduction to Numerical Computing With NumPy)☆13Aug 24, 2022Updated 3 years ago
- Matatika Community Edition☆28Apr 23, 2026Updated 2 weeks ago
- A code sample that allows you to send a payload from the Twitter API to Google Sheets.☆18Mar 23, 2021Updated 5 years ago
- ☆11Nov 25, 2020Updated 5 years ago
- Data warehouse tech stack with PostgreSQL, DBT and Airflow☆20Dec 29, 2025Updated 4 months ago
- ☆17May 16, 2020Updated 5 years ago
- ☆12Oct 6, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆23Nov 17, 2019Updated 6 years ago
- ☆20Aug 10, 2021Updated 4 years ago
- A tutorial on building a real-time data streaming application pipeline with Apache Kafka🔥🔥🔥☆24Apr 29, 2022Updated 4 years ago
- Local AWS EMR - A local service that imitates AWS EMR☆27Jul 5, 2023Updated 2 years ago
- Java client for EventStore (http://geteventstore.com)☆20May 25, 2015Updated 10 years ago
- Boto S3 Router provides a Boto3-like client that routes requests between S3 clients according to the bucket and the key in the request.☆18Mar 3, 2022Updated 4 years ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆29Aug 18, 2024Updated last year