mikeroyal / Apache-Airflow-Guide
Apache Airflow Guide
β28Updated 10 months ago
Alternatives and similar repositories for Apache-Airflow-Guide:
Users that are interested in Apache-Airflow-Guide are comparing it to the libraries listed below
- SQL/NoSQL DB Guide. Learn about SQL/NoSQL databases & Distributed Systems.β60Updated last year
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data πβ32Updated 2 years ago
- A curated list of awesome open source tools and commercial products that will help you manage machine learning and data-science workflowsβ¦β23Updated 2 years ago
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.β57Updated 3 years ago
- Awesome list of dataops products, open source and resourcesβ24Updated 2 years ago
- A tool to automatically infer columns data types in .csv filesβ35Updated 2 years ago
- This repository contains code to build an MVP search engine with google like interface.β15Updated 4 years ago
- duckdb-etl-frameworkβ10Updated 3 months ago
- β17Updated 7 months ago
- TensorFlow Guideβ16Updated 3 years ago
- A curated list of awesome Databricks resources, including Sparkβ17Updated 8 months ago
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/β11Updated 10 months ago
- Some example projects for Data Engineers to build, end-to-end.β28Updated last year
- Full stack data engineering tools and infrastructure set-upβ50Updated 4 years ago
- Challenge Data Engineerβ25Updated 2 years ago
- Repo for CDC with debezium blog postβ28Updated 6 months ago
- Awesome list for datapipelineβ34Updated 2 years ago
- TinyOlap is a light-weight, in-process, in-memory, multi-dimensional, model-first OLAP engine for planning, budgeting, reporting, analysiβ¦β44Updated 2 years ago
- Best practices for data workflows, integrations with the Modern Data Stack (MDS), Infrastructure as Code (IaC), Cloud Provider Servicesβ25Updated 2 weeks ago
- Delta Lake Documentationβ49Updated 9 months ago
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, PostgreSQL and Supersetβ39Updated 4 months ago
- Apache Spark Guideβ31Updated 3 years ago
- Sample fastAPI Application to demonstrate OpenTelemetry instrumentationβ14Updated 10 months ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.β57Updated 3 months ago
- Code for my "Efficient Data Processing in SQL" book.β56Updated 7 months ago
- β10Updated 2 years ago
- β12Updated last year
- Template for Data Engineering and Data Pipeline projectsβ108Updated 2 years ago
- Apache Kafka Guideβ31Updated 3 years ago
- demo examples how to load data from different sources to different destinationsβ20Updated last month