☆77Oct 22, 2018Updated 7 years ago
Alternatives and similar repositories for data-group
Users that are interested in data-group are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 论文阅读总结☆32Jun 13, 2019Updated 6 years ago
- The book of data warehouse☆196Oct 13, 2022Updated 3 years ago
- fast spark local mode☆35Aug 20, 2018Updated 7 years ago
- ☆19Jun 16, 2021Updated 4 years ago
- a hyper-optimized single-node(local) version of spark sql engine, which's fundamental data structure is scala Iterator rather than RDD.☆13Jun 13, 2023Updated 2 years ago
- DirectKafka examples for Spark Streaming : 1. with checkpointing 2. Custom offset management☆60Sep 9, 2016Updated 9 years ago
- Spark Streaming监控平台,支持任务部署与告警、自启动☆129Mar 29, 2018Updated 7 years ago
- Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.☆1,842May 29, 2024Updated last year
- 智能数据探索服务(Intelligent Data Exploration Service),一站式Data + AI数据解决方案!☆35Jul 10, 2023Updated 2 years ago
- JimSql = Jim Isn't MySQL. Jim is a filesystem database system implemention use Java.☆15Dec 15, 2025Updated 3 months ago
- A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…☆182Apr 6, 2022Updated 3 years ago
- Flink 1.6.0 文档地址☆38Nov 11, 2018Updated 7 years ago
- Hive-JDBC-Proxy是一个高性能的HiveServer2和Spark ThriftServer的代理服务,具备负载均衡、基于规则转发Hive JDBC Client的请求给到HiveServer2和Spark ThriftServer的能力。☆32Apr 12, 2022Updated 3 years ago
- 实现Local、FTP、HDFS文件系统的统一操作。☆13May 12, 2016Updated 9 years ago
- httpfs java client, read & write hdfs filesystem with the webhdfs REST HTTP API☆22Jun 7, 2014Updated 11 years ago
- scala、spark使用过程中,各种测试用例以及相关资料整理☆1,087Feb 9, 2019Updated 7 years ago
- Flink 案例代码☆43Jun 17, 2022Updated 3 years ago
- Moonbox is a DVtaaS (Data Virtualization as a Service) Platform☆506Apr 14, 2023Updated 2 years ago
- ☆13Jul 12, 2016Updated 9 years ago
- An ad hoc query service based on the spark sql engine.(基于spark sql引擎的即席查询服务)☆380Dec 16, 2023Updated 2 years ago
- 请求spark rest API获取applications,jobs,stages,executors,rdds,streaming,environment等信息提供监控和报警服务☆11Nov 22, 2018Updated 7 years ago
- 基于PowerCenter的数据质量监控系统☆13Dec 27, 2017Updated 8 years ago
- Flink: Stateful Computations over Data Streams☆15Aug 20, 2018Updated 7 years ago
- ecstore的官方docker镜像☆18May 12, 2019Updated 6 years ago
- Machine Learning written in TypeScript (to replace learn4js)☆10Apr 11, 2018Updated 7 years ago
- 四川大学拓思爱诺用户session行为数据离线分析项目☆68Jul 1, 2022Updated 3 years ago
- A small library of hive UDFS using Macros to process and manipulate complex types☆15Oct 2, 2025Updated 5 months ago
- ☆18May 21, 2019Updated 6 years ago
- MCP = Multiple source Convert Platform☆11Aug 2, 2022Updated 3 years ago
- E应用开发-ISV应用解决方案☆10Aug 17, 2018Updated 7 years ago
- 针对复杂业务逻辑的Java实现系统,抽象出一套编程框架,借鉴领域模型的设计方法,使得开发体验更加环保、更加友好,大大提高代码的后期可维护性☆24Aug 3, 2014Updated 11 years ago
- ☆10Sep 17, 2017Updated 8 years ago
- A Full RPC Framework Based on Netty.☆14May 19, 2018Updated 7 years ago
- hera 分布式任务调度系统 大数据任务调度系统 任务调度 (数据部门专用)☆377Aug 14, 2023Updated 2 years ago
- ☆10Nov 12, 2023Updated 2 years ago
- 基于Yarn的容器调度引擎(container scheduler based on yarn)☆36Apr 5, 2016Updated 9 years ago
- 一个手动管理spark streaming集成kafka时的偏移量到zookeeper中的小项目☆133Dec 17, 2025Updated 3 months ago
- 一个SQL 编辑器的前端界面☆18Nov 28, 2020Updated 5 years ago
- ☆30Dec 24, 2022Updated 3 years ago