arezamoosavi / AcidOnSpark-ETLView external linksLinks
Delta-Lake, ETL, Spark, Airflow
☆48Oct 9, 2022Updated 3 years ago
Alternatives and similar repositories for AcidOnSpark-ETL
Users that are interested in AcidOnSpark-ETL are comparing it to the libraries listed below
Sorting:
- ☆14Oct 10, 2025Updated 4 months ago
- ☆21Feb 5, 2024Updated 2 years ago
- ☆21Dec 11, 2021Updated 4 years ago
- ☆270Oct 23, 2024Updated last year
- Spark and Hive docker containers sharing a common MySQL metastore☆26Apr 17, 2020Updated 5 years ago
- A minimal docker compose setup for experimenting with cloud agnostic Lakehouse Architectures Apache Spark with Hive Metastore + Delta Lak…☆34Apr 17, 2024Updated last year
- A partially implemented ODBC driver for the Trino distributed SQL engine☆18Feb 2, 2026Updated last week
- This project implements a Lakehouse Medallion Architecture using modern Data Stack tools such as Fivetran, Snowflake and dbt. The fictici…☆14Sep 30, 2024Updated last year
- COMS 4111 Project 1☆12Jul 21, 2022Updated 3 years ago
- ☆15Apr 1, 2025Updated 10 months ago
- The official github repo for the open online courses: "Dive into LLMs".☆10Mar 15, 2024Updated last year
- ☆17Jun 8, 2025Updated 8 months ago
- A small guide to make recruiting a little easier.☆13Apr 3, 2023Updated 2 years ago
- A data generator for Apache Druid☆12Mar 26, 2025Updated 10 months ago
- ☆10Jul 21, 2022Updated 3 years ago
- Google Cloud Platform solution that provides an event driven process that flattens (unnests) Google Analytics 360 data that has been expo…☆16Sep 9, 2021Updated 4 years ago
- ☆41Jul 4, 2022Updated 3 years ago
- A write-audit-publish implementation on a data lake without the JVM☆45Aug 12, 2024Updated last year
- Toy Hadoop cluster combining various SQL-on-Hadoop variants☆13Nov 16, 2017Updated 8 years ago
- Enhancing Explainability in Fake News Detection uses SHAP and BiLSTM models to improve the transparency and interpretability of detecting…☆11Oct 11, 2024Updated last year
- ☆15Jan 26, 2026Updated 2 weeks ago
- Mathematics + Statistics Courses at the University of Alberta☆15Jan 8, 2023Updated 3 years ago
- This Power BI project provides insights into customer orders and product tracking using interactive dashboards. It visualizes order statu…☆10Aug 15, 2025Updated 5 months ago
- On-device real-time RAG App built using Jina Reader, Mediapipe, Gemma 2b IT LLM.☆15Apr 15, 2024Updated last year
- treelite runtime binding in Rust☆12Jun 12, 2025Updated 8 months ago
- A simple Python/Django CCTV system for IP cameras☆14Jan 15, 2017Updated 9 years ago
- Converts a database model designed in Vertabelo (http://vertabelo.com) to Flask-SQLAlchemy (https://pythonhosted.org/Flask-SQLAlchemy/) m…☆11Oct 27, 2025Updated 3 months ago
- A collection of small dash apps which I created for learning purposes. Some of them answer questions asked on the plotly forum. https://c…☆11Feb 8, 2024Updated 2 years ago
- ☆12Jan 27, 2022Updated 4 years ago
- This project ties together Flask, Dash, Docker and Nginx for bootstraping CI\CD pipelines of Flask \ Dash \ Plot.ly Applications☆12Jan 8, 2026Updated last month
- Build a data pipeline with Apache Airflow☆11May 7, 2021Updated 4 years ago
- ☆12Jun 14, 2024Updated last year
- Implement D*Lite and A* Algorithm on Processing environment☆11Apr 7, 2017Updated 8 years ago
- The elegance of Airflow + the power of AWS☆52Feb 5, 2024Updated 2 years ago
- ☆11Mar 15, 2017Updated 8 years ago
- Machine learning and statistical test to evaluate whether a pricing test running on the site has been successful☆11Jul 17, 2017Updated 8 years ago
- Language Translator using the new GPT-4o model☆17May 15, 2024Updated last year
- Deploying a Machine Learning model streaming application with Apache Kafka☆11Aug 21, 2022Updated 3 years ago
- ☆10Jul 2, 2017Updated 8 years ago