loicmathieu / docker-cdhLinks
Here is my git repo for my Docker files related to Cloudera Hadoop CDH, to start, best is to check the documentation on https://github.com/loicmathieu/docker-cdh/tree/master/cloudera-cdh-edgenode
☆56Updated 6 years ago
Alternatives and similar repositories for docker-cdh
Users that are interested in docker-cdh are comparing it to the libraries listed below
Sorting:
- Apache Flink docker image☆195Updated 3 years ago
- A sample of Flink TiDB Realtime Datawarehouse.☆85Updated 4 years ago
- This repository trackes the code and files for building docker image with Apache Kylin.☆126Updated 3 years ago
- ☆28Updated 3 years ago
- Unified SQL Analytics Engine Based on SparkSQL☆211Updated 2 years ago
- A web application for submitting spark application☆8Updated 4 years ago
- Docker image with Ambari☆291Updated 7 years ago
- ☆14Updated 3 years ago
- facebook presto connectors☆49Updated 3 years ago
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Updated last week
- Data self exporting and monitoring platform based on Hive data warehouse. https://hc.smartloli.org☆36Updated 7 years ago
- A library based on delta for Spark and MLSQL☆61Updated 4 years ago
- CDH安装手册☆86Updated 2 years ago
- Learning Apache Kylin for beginner☆29Updated 7 years ago
- Guardian of Waterdrop and Spark☆30Updated 2 years ago
- 基于Yarn的容器调度引擎(container scheduler based on yarn)☆36Updated 9 years ago
- Flink Sql 教程☆34Updated 7 months ago
- Example of using greenplum-spark connector☆19Updated 6 years ago
- ☆30Updated 2 years ago
- Real-time ETL developed by Flink, data from MySQL to Greenplum. Use canal to parse the MySQL binlog, put it into kafka, use Flink to cons…☆80Updated last year
- Streaming application development and management system, based on Linkis and DSS, planning to provide the workflow-like graphical drag-an…☆107Updated 2 months ago
- Open-source distribute workflow schedule tools, also support streaming task.☆39Updated 7 years ago
- Docker packaging for Apache Flink☆139Updated 5 years ago
- DataTunnel 是一个基于spark引擎的超高性能的分布式数据集成软件,支持海量数据的同步。基于spark extensions 扩展的DSL语法,结合的Spark SQL,更加便捷融入数仓 ETLT 过程中,简单易用。☆29Updated last week
- 使用Hive读写solr☆31Updated 3 years ago
- hadoop (hadoop,hive,hue,hbase) deployer☆111Updated 11 years ago
- flink iceberg integration tests, jobs running on yarn.☆38Updated 4 years ago
- Make data connection easier☆22Updated 3 years ago
- ☆48Updated last year
- Ambari stack service for easily installing and managing Hue on HDP cluster☆107Updated 5 years ago