hadoop各组件使用,持续更新
☆900Jan 4, 2023Updated 3 years ago
Alternatives and similar repositories for cdhproject
Users that are interested in cdhproject are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于CDH5.x parcles安装,一键卸载脚本☆39Sep 28, 2022Updated 3 years ago
- flink learning blog. http://www.54tianzhisheng.cn☆25Sep 26, 2019Updated 6 years ago
- HBase快照增量导出☆19Nov 2, 2017Updated 8 years ago
- A data integration framework☆4,108Dec 2, 2025Updated 4 months ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆2,325Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Livy and Zeppelin services for Cloudera Manager and CDH using CSDs and Parcels☆22Aug 16, 2018Updated 7 years ago
- flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Ta…☆15,052Apr 14, 2026Updated 2 weeks ago
- 基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法☆2,056Feb 21, 2024Updated 2 years ago
- 定期更新Hadoop生态圈中常用大数据组件文档 重心依次为: Flink Solr Sparksql ES Scala Kafka Hbase/phoenix Redis Kerberos (项目包含hadoop思维导图 印象笔记 Scala版本简单demo …☆922Mar 9, 2026Updated last month
- 酷玩 Spark: Spark 源代码解析、Spark 类库等☆3,479May 18, 2022Updated 3 years ago
- Flink 中文视频课程(持续更新...)☆4,624Jun 18, 2020Updated 5 years ago
- An ad hoc query service based on the spark sql engine.(基于spark sql引擎的 即席查询服务)☆380Dec 16, 2023Updated 2 years ago
- hera 分布式任务调度系统 大数据任务调度系统 任务调度 (数据部门专用)☆378Aug 14, 2023Updated 2 years ago
- CDH安装手册☆86Apr 23, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources☆2,045Oct 25, 2022Updated 3 years ago
- 从本地IDEA提交Flink/Spark任务到Yarn/k8s集群☆167Oct 18, 2021Updated 4 years ago
- Wormhole is a SPaaS (Stream Processing as a Service) Platform☆976Nov 16, 2022Updated 3 years ago
- Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code☆14,244Apr 24, 2026Updated last week
- SeaTunnel is a multimodal, high-performance, distributed, massive data integration tool.☆9,280Apr 24, 2026Updated last week
- Cloudera Manager API Client☆310Dec 17, 2023Updated 2 years ago
- DBus☆1,214Dec 6, 2022Updated 3 years ago
- presto hbase connector 组件基于Presto Connector接口规范实现,用来给Presto增加查询HBase的功能。相比其他开源版本的HBase Connector,我们的性能要快10到100倍以上。☆242Jan 2, 2023Updated 3 years ago
- Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications…☆3,406Apr 15, 2026Updated 2 weeks ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Apache Spark 官方文档中文版☆1,180Jul 21, 2023Updated 2 years ago
- Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.☆1,840May 29, 2024Updated last year
- CDH5.16.2 离线安装脚本☆17Jan 19, 2021Updated 5 years ago
- Cloudera Manager Extensibility Tools and Documentation.☆194Dec 16, 2023Updated 2 years ago
- DataX集成可视化页面,选择数据源即可一键生成数据同步任务,支持RDBMS、Hive、HBase、ClickHouse、MongoDB等数据源,批量创建RDBMS数据同步任务,集成开源调度系统,支持分布式、增量同步数据、实时查看运行日志、监控执行器资源、KILL运行进程、…☆5,993Jun 2, 2024Updated last year
- A AI-Driven, Distributed and high-performance monitoring system, for comprehensive monitoring and management of kafka cluster.☆3,177Dec 18, 2025Updated 4 months ago
- DataX是阿里云DataWorks数据集成的开源版本。☆17,190Jul 1, 2025Updated 10 months ago
- Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, res…☆813Dec 11, 2024Updated last year
- 封装sparkstreaming动态调节batch time(有数据就执行计算); 支持运行过程中增删topic; 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。☆181Apr 15, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Stream computing platform for bigdata☆408Apr 24, 2024Updated 2 years ago
- Spark-2.3.1源码解读☆199Dec 5, 2022Updated 3 years ago
- DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitizati…☆3,257Nov 4, 2025Updated 5 months ago
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆153Apr 21, 2023Updated 3 years ago
- DirectKafka examples for Spark Streaming : 1. with checkpointing 2. Custom offset management☆60Sep 9, 2016Updated 9 years ago
- 基于 Flink 的 sqlSubmit 程序☆145Mar 6, 2024Updated 2 years ago
- ☆393Jan 25, 2024Updated 2 years ago