dantezhao/data-group

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dantezhao/data-group)

dantezhao / data-group

☆76

Alternatives and similar repositories for data-group

Users that are interested in data-group are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dantezhao / paper-notes
View on GitHub
论文阅读总结
☆32Jun 13, 2019Updated 7 years ago
dantezhao / data-warehouse
View on GitHub
The book of data warehouse
☆196Oct 13, 2022Updated 3 years ago
passionke / starry
View on GitHub
fast spark local mode
☆35Aug 20, 2018Updated 7 years ago
allwefantasy / mlsql-web-console
View on GitHub
☆18Jun 16, 2021Updated 5 years ago
direct-spark-sql / direct-spark-sql
View on GitHub
a hyper-optimized single-node(local) version of spark sql engine, which's fundamental data structure is scala Iterator rather than RDD.
☆13Jun 13, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
cjuexuan / mynote
View on GitHub
☆233Sep 15, 2022Updated 3 years ago
cpbaranwal / Spark-Streaming-DirectKafka-Examples
View on GitHub
DirectKafka examples for Spark Streaming : 1. with checkpointing 2. Custom offset management
☆59Sep 9, 2016Updated 9 years ago
xinghalo / Teddy
View on GitHub
Spark Streaming监控平台，支持任务部署与告警、自启动
☆129Mar 29, 2018Updated 8 years ago
byzer-org / byzer-lang
View on GitHub
Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.
☆1,835May 29, 2024Updated 2 years ago
Skycrab / model-deploy
View on GitHub
Spark PMML 模型离线部署
☆13Dec 14, 2022Updated 3 years ago
bebee4java / ides
View on GitHub
智能数据探索服务(Intelligent Data Exploration Service)，一站式Data + AI数据解决方案！
☆36Jul 10, 2023Updated 3 years ago
bingosummer / techcontent
View on GitHub
☆10Nov 30, 2016Updated 9 years ago
dafei1288 / jimsql
View on GitHub
JimSql = Jim Isn't MySQL. Jim is a filesystem database system implemention use Java.
☆15Dec 15, 2025Updated 7 months ago
yaooqinn / spark-authorizer
View on GitHub
A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…
☆183Apr 6, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
flink-china / 1.6.0
View on GitHub
Flink 1.6.0 文档地址
☆38Nov 11, 2018Updated 7 years ago
gudaoxuri / ez-fs
View on GitHub
实现Local、FTP、HDFS文件系统的统一操作。
☆13May 12, 2016Updated 10 years ago
jacksu / utils4s
View on GitHub
scala、spark使用过程中，各种测试用例以及相关资料整理
☆1,082Feb 9, 2019Updated 7 years ago
running-elephant / moonbox
View on GitHub
Moonbox is a DVtaaS (Data Virtualization as a Service) Platform
☆505Apr 14, 2023Updated 3 years ago
teeyog / IQL
View on GitHub
An ad hoc query service based on the spark sql engine.(基于spark sql引擎的即席查询服务)
☆377Dec 16, 2023Updated 2 years ago
leno1001 / spark_monitor
View on GitHub
请求spark rest API获取applications，jobs，stages，executors，rdds，streaming，environment等信息提供监控和报警服务
☆11Nov 22, 2018Updated 7 years ago
xuwei517 / FlinkProj
View on GitHub
Flink 案例代码
☆43Apr 24, 2026Updated 3 months ago
coolplaydata / coolplayflink
View on GitHub
Flink: Stateful Computations over Data Streams
☆15Aug 20, 2018Updated 7 years ago
dtinfor / dqcenter
View on GitHub
基于PowerCenter的数据质量监控系统
☆13Dec 27, 2017Updated 8 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
luangm / learn4ts
View on GitHub
Machine Learning written in TypeScript (to replace learn4js)
☆10Apr 11, 2018Updated 8 years ago
wanghan0501 / UserSessionBehaviorOfflineAnalysis
View on GitHub
四川大学拓思爱诺用户session行为数据离线分析项目
☆68Jul 1, 2022Updated 4 years ago
onfocusio / kafka-connect-kudu
View on GitHub
kafka-connect-kudu is a Kafka Connector for loading data to Apache Kudu
☆11Jun 2, 2017Updated 9 years ago
opendingtalk / eapp-isv-project
View on GitHub
E应用开发-ISV应用解决方案
☆10Aug 17, 2018Updated 7 years ago
neoremind / biz-framework
View on GitHub
针对复杂业务逻辑的Java实现系统，抽象出一套编程框架，借鉴领域模型的设计方法，使得开发体验更加环保、更加友好，大大提高代码的后期可维护性
☆24Aug 3, 2014Updated 11 years ago
scxwhite / hera
View on GitHub
hera 分布式任务调度系统大数据任务调度系统任务调度（数据部门专用）
☆378Aug 14, 2023Updated 2 years ago
whitestarlau / PracticeRustWebServer
View on GitHub
☆10Nov 12, 2023Updated 2 years ago
hammerlab / spark-json-relay
View on GitHub
SparkListener that converts SparkListenerEvents to JSON and forwards them to an external service via RPC.
☆16Apr 6, 2021Updated 5 years ago
xpleaf / minidubbo
View on GitHub
A Full RPC Framework Based on Netty.
☆14May 19, 2018Updated 8 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
IRoye / sqlEditor
View on GitHub
一个SQL 编辑器的前端界面
☆18Nov 28, 2020Updated 5 years ago
ziaochina / mk-meta-engine
View on GitHub
元数据化引擎，在mk-app-loader实现的应用隔离基础上，实现可以用json元数据描述界面模型
☆12Jul 23, 2018Updated 8 years ago
qindongliang / streaming-offset-to-zk
View on GitHub
一个手动管理spark streaming集成kafka时的偏移量到zookeeper中的小项目
☆133Dec 17, 2025Updated 7 months ago
allwefantasy / mammuthus-yarn-docker-scheduler
View on GitHub
基于Yarn的容器调度引擎(container scheduler based on yarn)
☆36Apr 5, 2016Updated 10 years ago
bebee4java / spark-notes
View on GitHub
spark学习中文笔记
☆12Mar 26, 2019Updated 7 years ago
cas-packone / ambari-kylin-service
View on GitHub
☆30Dec 24, 2022Updated 3 years ago
zaratsian / SparkPhoenix
View on GitHub
Spark Example using Phoenix to interact with HBase
☆16Nov 2, 2016Updated 9 years ago