loicmathieu / docker-cdhLinks
Here is my git repo for my Docker files related to Cloudera Hadoop CDH, to start, best is to check the documentation on https://github.com/loicmathieu/docker-cdh/tree/master/cloudera-cdh-edgenode
☆56Updated 7 years ago
Alternatives and similar repositories for docker-cdh
Users that are interested in docker-cdh are comparing it to the libraries listed below
Sorting:
- Apache Flink docker image☆197Updated 3 years ago
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Updated 5 months ago
- Unified SQL Analytics Engine Based on SparkSQL☆212Updated 3 years ago
- Data self exporting and monitoring platform based on Hive data warehouse. https://hc.smartloli.org☆36Updated 8 years ago
- ☆30Updated 3 years ago
- ☆251Updated 3 years ago
- This repository trackes the code and files for building docker image with Apache Kylin.☆127Updated 4 years ago
- CDH安装手册☆86Updated 2 years ago
- ☆29Updated 4 years ago
- A library based on delta for Spark and MLSQL☆61Updated 5 years ago
- Custom Service for deploying Apache Alluxio on a running HDP 2.3 / IOP 4.1 Ambari Managed Cluster☆13Updated 8 years ago
- Example of using greenplum-spark connector☆20Updated 6 years ago
- Ambari service for Apache Flink☆127Updated 4 years ago
- Java library to integrate Flink and Kudu☆55Updated 8 years ago
- ☆48Updated 2 years ago
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆152Updated 2 years ago
- Ambari service for Presto☆44Updated 11 months ago
- Real-time ETL developed by Flink, data from MySQL to Greenplum. Use canal to parse the MySQL binlog, put it into kafka, use Flink to cons…☆80Updated last year
- Streaming application development and management system, based on Linkis and DSS, planning to provide the workflow-like graphical drag-an…☆111Updated 8 months ago
- 反应式 海量数据治理平台☆39Updated 5 years ago
- Import data from clickhouse to hadoop with pure SQL☆36Updated 6 years ago
- ☆14Updated 3 years ago
- Java client for managing Apache Flink via REST API☆57Updated 4 months ago
- Docker packaging for Apache Flink☆139Updated 5 years ago
- ☆56Updated 3 years ago
- Airflow Dag可视化编辑和管理☆45Updated 3 years ago
- [Cloudframeworks]SMACK Big Data Architecture - user guide / [云框架]SMACK大数据架构-用户指南☆70Updated 8 years ago
- facebook presto connectors☆49Updated 4 years ago
- Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies…☆22Updated 6 years ago
- DataTunnel 是一个基于spark引擎的超高性能的分布式数据集成软件,支持海量数据的同步 。基于spark extensions 扩展的DSL语法,结合的Spark SQL,更加便捷融入数仓 ETLT 过程中,简单易用。☆35Updated last month