loicmathieu / docker-cdhLinks
Here is my git repo for my Docker files related to Cloudera Hadoop CDH, to start, best is to check the documentation on https://github.com/loicmathieu/docker-cdh/tree/master/cloudera-cdh-edgenode
☆56Updated 7 years ago
Alternatives and similar repositories for docker-cdh
Users that are interested in docker-cdh are comparing it to the libraries listed below
Sorting:
- Apache Flink docker image☆195Updated 3 years ago
- This repository trackes the code and files for building docker image with Apache Kylin.☆126Updated 3 years ago
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Updated 3 months ago
- Import data from clickhouse to hadoop with pure SQL☆36Updated 6 years ago
- ☆14Updated 3 years ago
- Unified SQL Analytics Engine Based on SparkSQL☆212Updated 2 years ago
- Docker image with Ambari☆291Updated 7 years ago
- Streaming application development and management system, based on Linkis and DSS, planning to provide the workflow-like graphical drag-an…☆109Updated 5 months ago
- Example of using greenplum-spark connector☆20Updated 6 years ago
- A library based on delta for Spark and MLSQL☆61Updated 4 years ago
- Ambari service for Presto☆44Updated 8 months ago
- ☆30Updated 2 years ago
- ☆48Updated 2 years ago
- make Impala on HDP enabled☆52Updated 6 years ago
- ☆43Updated 6 years ago
- Kafka Connect to Hbase☆43Updated 4 years ago
- Real-time ETL developed by Flink, data from MySQL to Greenplum. Use canal to parse the MySQL binlog, put it into kafka, use Flink to cons…☆80Updated last year
- Airflow Dag可视化编辑和管理☆46Updated 2 years ago
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆152Updated 2 years ago
- A sample of Flink TiDB Realtime Datawarehouse.☆84Updated 4 years ago
- CDH安装手册☆86Updated 2 years ago
- ☆56Updated 2 years ago
- 杭州第六次 Spark & Flink Meetup☆30Updated 7 years ago
- Data self exporting and monitoring platform based on Hive data warehouse. https://hc.smartloli.org☆36Updated 8 years ago
- Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies…☆22Updated 6 years ago
- ☆28Updated 3 years ago
- Make data connection easier☆22Updated 3 years ago
- Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside in HBase region servers☆69Updated 3 weeks ago
- presto hbase connector 组件基于Presto Connector接口规范实现,用来给Presto增加查询HBase的功能。相比其他开源版本的HBase Connector,我们的性能要快10到100倍以上。☆243Updated 2 years ago
- Guardian of Waterdrop and Spark☆30Updated 2 years ago