如果你在从事大数据BI的工作,想对比一下MySQL、GreenPlum、Elasticsearch、Hive、Spark SQL、Presto、Impala、Drill、HAWQ、Druid、Pinot、Kylin、ClickHouse、Kudu等不同实现方案之间的表现,那你就需要一份标准的数据进行测试,这个开源项目就是为了生成这样的标准数据。
☆285May 24, 2018Updated 7 years ago
Alternatives and similar repositories for data-generator
Users that are interested in data-generator are comparing it to the libraries listed below
Sorting:
- Data self exporting and monitoring platform based on Hive data warehouse. https://hc.smartloli.org☆36Jul 28, 2017Updated 8 years ago
- presto hbase connector 组件基于Presto Connector接口规范实现,用来给Presto增加查询HBase的功能。相比其他开源版本的HBase Connector,我们的性能要快10到100倍以上。☆242Jan 2, 2023Updated 3 years ago
- 基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法☆2,059Feb 21, 2024Updated 2 years ago
- A data integration framework☆4,110Dec 2, 2025Updated 2 months ago
- 数据库访问中间件,统一的标准sql查询,底层可以是不同的数据库包括mysql、ElasticSearch、kylin、presto等。☆14Apr 21, 2018Updated 7 years ago
- 适合2到6岁的宝宝打字游戏☆10May 29, 2020Updated 5 years ago
- DataX 是异构数据广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、SqlServer、Postgre、HDFS、Hive、ADS、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。☆22Jan 31, 2018Updated 8 years ago
- Alerting and monitoring tool for Apache Spark☆23May 20, 2022Updated 3 years ago
- 学习 Spark 的一个小项目,以及其中各种调优的笔记☆177Jul 20, 2017Updated 8 years ago
- 此项目主要应用于数据中台或数据平台的数据总线,支持直接实时监听MySQL、MongoDB、PostgreSQL、Oracle、SQL Server、Db2和Cassandra等数据库的数据变更。☆64Dec 5, 2023Updated 2 years ago
- 基于ansible的Greenplum集群多主机节点一键安装工具//dbswitch.gitee.io/docs-site/☆15Jul 4, 2021Updated 4 years ago
- Profiling Spark Applications for Performance Comparison and Diagnosis☆17Nov 11, 2018Updated 7 years ago
- 微服务日志之实时日志☆30Jul 6, 2018Updated 7 years ago
- Bireme is an incremental synchronization tool for the Greenplum / HashData data warehouse☆138Feb 23, 2022Updated 4 years ago
- An ad hoc query service based on the spark sql engine.(基于spark sql引擎的即席查询服务)☆381Dec 16, 2023Updated 2 years ago
- log、event 、time 、window 、table、sql、connect、join、async IO、维表、CEP☆69Sep 8, 2022Updated 3 years ago
- 基于TBSchedule开发的一个分布式任务调度框架,可以解析任务间的依赖,并执行任务(执行Shell、bat脚本)☆12Aug 5, 2016Updated 9 years ago
- spark to yandex clickhouse connector☆69Sep 4, 2019Updated 6 years ago
- A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources☆2,055Oct 25, 2022Updated 3 years ago
- datax web。datax中的web配置界面没有集成在一起开源出来,此为web端配置项目。☆99Mar 19, 2019Updated 6 years ago
- 使用flink快速构建实时监控系统报警☆19Sep 7, 2019Updated 6 years ago
- kudu可视化工具☆38Jul 12, 2025Updated 7 months ago
- 杭州第六次 Spark & Flink Meetup☆30May 14, 2018Updated 7 years ago
- 分布式任务调度框架教程, 包括: Quartz、Elastic-Job和TBSchedule.☆32Mar 4, 2019Updated 6 years ago
- 使用spring-boot-spark的一个样例☆11Aug 3, 2018Updated 7 years ago
- Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.☆1,845May 29, 2024Updated last year
- java爬虫,反爬虫策略、ETL清洗数据,以及spark离线和实时分析新闻并存入ES☆19Nov 26, 2018Updated 7 years ago
- 基于Spring Boot 2.x的前后端分离架构JBoot 前台:Vue+iView 后台:Spring Boot 2.x/Spring Security/JWT/Spring Data JPA+Mybatis-Plus/Redis/Elasticsearch 分布式限…☆23Jan 28, 2019Updated 7 years ago
- Wormhole is a SPaaS (Stream Processing as a Service) Platform☆977Nov 16, 2022Updated 3 years ago
- presto's elasticsearch connector☆11Dec 7, 2016Updated 9 years ago
- 基于spring boot + quartz + redis实现job任务调度,前端使用vue和element-ui实现页面控制台。☆13Jan 30, 2019Updated 7 years ago
- Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications…☆3,416Feb 12, 2026Updated 2 weeks ago
- Apache Spark structured streaming connector for Yandex ClickHouse OLAP☆16Aug 10, 2017Updated 8 years ago
- DataX集成可视化页面,选择数据源即可一键生成数据同步任务,支持RDBMS、Hive、HBase、ClickHouse、MongoDB等数据源,批量创建RDBMS数据同步任务,集成开源调度系统,支持分布式、增量同步数据、实时查看运行日志、监控执行器资源、KILL运行进程、…☆5,984Jun 2, 2024Updated last year
- 给flink开发的web系统。支持页面上定义udf,进行sql和jar任务的提交;支持source、sink、job的管理;可以管理openshift上的flink集群☆290Dec 16, 2022Updated 3 years ago
- A simple project used to submit a Flink SQL script☆376Sep 2, 2019Updated 6 years ago
- DBus☆1,214Dec 6, 2022Updated 3 years ago
- AntsDB is a low latency, high concurrency, MySQL compliant SQL layer for HBase☆109Jun 21, 2022Updated 3 years ago
- 基于flink的实时流计算web平台☆1,869Dec 2, 2025Updated 2 months ago