databooks / databook
Databooks, the book of data.
☆22Updated 2 years ago
Alternatives and similar repositories for databook:
Users that are interested in databook are comparing it to the libraries listed below
- ☆47Updated last year
- Unified SQL Analytics Engine Based on SparkSQL☆210Updated 2 years ago
- 数据治理、数据质量检核/监控平台(Django+jQuery+MySQL)☆184Updated 2 years ago
- Airflow Dag可视化编辑和管理☆47Updated 2 years ago
- ☆25Updated last year
- example☆66Updated 4 years ago
- ☆28Updated 3 years ago
- Deploy bigdata platform using docker compose. Big data components include hadoop, hive, hbase, presto, flink, es, kafka, etc.☆138Updated 5 months ago
- A sample of Flink TiDB Realtime Datawarehouse.☆84Updated 3 years ago
- Real-time ETL developed by Flink, data from MySQL to Greenplum. Use canal to parse the MySQL binlog, put it into kafka, use Flink to cons…☆78Updated 11 months ago
- 数据治理->数据质量☆10Updated 5 years ago
- 记录HBase版本API的变迁Demo☆33Updated 5 years ago
- Flink SQL Management☆8Updated 4 years ago
- 反应式 海量数据治理平台☆39Updated 4 years ago
- ☆56Updated 2 years ago
- MaxCompute spark demo for building a runnable application.☆105Updated last month
- DataSphereStudio documents.☆114Updated 2 months ago
- ☆42Updated 5 years ago
- ☆14Updated 2 years ago
- presto hbase connector 组件基于Presto Connector接口规 范实现,用来给Presto增加查询HBase的功能。相比其他开源版本的HBase Connector,我们的性能要快10到100倍以上。☆241Updated 2 years ago
- A library based on delta for Spark and MLSQL☆61Updated 4 years ago
- 如果你在从事大数据BI的工作,想对比一下MySQL、GreenPlum、Elasticsearch、Hive、Spark SQL、Presto、Impala、Drill、HAWQ、Druid、Pinot、Kylin、ClickHouse、Kudu等不同实现方案之间的表现,…☆279Updated 6 years ago
- Data quality check tools by execute sql☆21Updated 7 years ago
- Nebula-Algorithm is a Spark Application based on GraphX, which enables state of art Graph Algorithms to run on top of NebulaGraph and wri…☆74Updated 6 months ago
- [译] Airflow 中文文档☆213Updated last year
- Spark 脚手架工程,标准化 spark 开发、部署、测试流程。☆93Updated 4 months ago
- Pack clouds and engines into one light stack☆34Updated last year
- ☆16Updated 2 years ago
- πflow is a big data flow engine with spark support☆526Updated 3 months ago
- seatunnel plugin developing examples.☆35Updated 3 years ago