ysc / data-generator
如果你在从事大数据BI的工作,想对比一下MySQL、GreenPlum、Elasticsearch、Hive、Spark SQL、Presto、Impala、Drill、HAWQ、Druid、Pinot、Kylin、ClickHouse、Kudu等不同实现方案之间的表现,那你就需要一份标准的数据进行测试,这个开源项目就是为了生成这样的标准数据。
☆281Updated 6 years ago
Alternatives and similar repositories for data-generator:
Users that are interested in data-generator are comparing it to the libraries listed below
- mysql数据实时增量导入hive☆87Updated 7 years ago
- azkaban小助手,增加任务web配置、远程脚本调用、报警扩展、跨项目依赖等功能。☆119Updated 7 years ago
- 给flink开发的web系统。支持页面上定义udf,进行sql和jar任务的提交;支持source、sink、job的管理;可以管理openshift上的flink集群☆284Updated 2 years ago
- Unified SQL Analytics Engine Based on SparkSQL☆210Updated 2 years ago
- 为DataX(https://github.com/alibaba/DataX) 提供远程多语言调用(ThriftServer,HttpServer) 分布式运行(DataX on YARN) 功能☆144Updated last year
- presto hbase connector 组件基于Presto Connector接口规范实现,用来给Presto增加查询HBase的功能。相比其他开源版本的HBase Connector,我们的性能要快10到100倍以上。☆241Updated 2 years ago
- flink-sql 在 flink 上运行 sql 和 构建数据流的平台 基于 apache flink 1.10.0☆110Updated 2 years ago
- Spark、Flink等离线任务的调度以及实时任务的监控☆300Updated last year
- Platform for Flink☆281Updated 2 years ago
- A simple project used to submit a Flink SQL script☆370Updated 5 years ago
- Atlas官方文档中文版☆70Updated 5 years ago
- Stream computing platform for bigdata☆401Updated 11 months ago
- Spark Streaming监控平台,支持任务部署与告警、自启动☆128Updated 7 years ago
- An ad hoc query service based on the spark sql engine.(基于spark sql引擎的即席查询服务)☆383Updated last year
- datax web。datax中的web配置界面没有集成在一起开源出来,此为web端配置项目。☆100Updated 6 years ago
- The Data Processer☆95Updated 8 years ago
- 从本地IDEA提交Flink/Spark任务到Yarn/k8s集群☆162Updated 3 years ago
- hadoop,hbase,storm,spark,etc..☆160Updated 5 years ago
- Ctrip Hadoop Job Scheduling System derived from https://github.com/alibaba/zeus☆155Updated 8 years ago
- ☆148Updated 9 months ago
- Real time data processing system based on flink and CEP☆248Updated 3 months ago
- Real-time ETL developed by Flink, data from MySQL to Greenplum. Use canal to parse the MySQL binlog, put it into kafka, use Flink to cons…☆78Updated last year
- 基于 Flink 的 sqlSubmit 程序☆145Updated last year
- A sample of Flink TiDB Realtime Datawarehouse.☆84Updated 3 years ago
- ☆137Updated 6 years ago
- Flink代码实例☆122Updated 4 years ago
- Flink 菜鸟公众号代码地址☆64Updated 4 months ago
- datax数据同步elasticsearch的reader和writer插件,支持一对多的扁平数据转换成es的嵌套对象,也支持嵌套对象的读取和ognl表达式过滤,理论上可以无限嵌套。☆88Updated last year
- 分享一些在工作中的大数据实战案例,包括flink、kafka、hadoop、presto等等。欢迎大家关注我的公众号【大数据技术与应用实战】,一起成长。☆264Updated last year
- 数据血缘,Hive/Sqoop/HBase/Spark等,发送到kafka后,解析处理使用neo4j生成血缘☆81Updated 3 years ago