harrytandata / airflow_course
☆172Updated 2 years ago
Alternatives and similar repositories for airflow_course:
Users that are interested in airflow_course are comparing it to the libraries listed below
- Provide docker environment and examples for PyFlink☆192Updated last year
- apache-airflow 系列中文资料 👏 `Star`☆48Updated 3 years ago
- [译] Airflow 中文文档☆212Updated last year
- 基于 PyFlink 的学习文档,通过一个个小实践,便于大家快速入手 PyFlink☆272Updated 3 years ago
- ☆29Updated 5 years ago
- python ETL framework☆100Updated 3 years ago
- example☆66Updated 4 years ago
- Airflow Dag可视化编辑和管理☆47Updated 2 years ago
- PyFlink从入门到精通☆23Updated last year
- Apache DolphinScheduler Python API, aka PyDolphinscheduler.☆53Updated last week
- A library developed to ease the data ETL development process.☆133Updated 2 months ago
- This project demonstrates Real-Time streaming of CDC data from MySql to Apache Iceberg using Flink SQL Client for faster data analytics a…☆22Updated last year
- SQL blood relationship analysis tool based on Python sqlparse☆27Updated 2 years ago
- A Hadoop cluster based on Docker, including Hive and Spark.☆77Updated 2 years ago
- ☆16Updated 2 years ago
- Deploy bigdata platform using docker compose. Big data components include hadoop, hive, hbase, presto, flink, es, kafka, etc.☆138Updated 4 months ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆162Updated 3 years ago
- The book of data warehouse☆195Updated 2 years ago
- smartnotebook docker-compose 部署脚本☆22Updated 2 months ago
- SuperBI 是达闼科技以开源项目superset为基础开发的企业级快速BI应用。 可扩展的框架设计,支持多种DBMS数据源,让数据BI更加简单。 superbi提供直观的UI,拖拽式的编辑体验,配置式的图例创建,轻松创建数据可视化dashboard的能力。☆47Updated 3 years ago
- ☆230Updated last year
- ☆47Updated last year
- 使用 python 操作大数据的各种组件☆62Updated last year
- Hadoop-Hive-Spark cluster + Jupyter on Docker☆65Updated 3 weeks ago
- Stock analysis MLOps system based on DolphinScheduler☆12Updated 2 years ago
- Airflow POC demo : 1) env set up 2) airflow DAG 3) Spark/ML pipeline | #DE☆12Updated 2 years ago
- Python client for Trino☆344Updated last week
- Notes talking about the design and implementation of Apache Spark☆19Updated 4 years ago
- Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi☆112Updated last year
- A process that runs in unison with Apache Airflow to control the Scheduler process to ensure High Availability☆232Updated 2 years ago