Lewuathe / docker-hadoop-cluster
Multiple node cluster on Docker for self development.
☆93Updated 6 years ago
Alternatives and similar repositories for docker-hadoop-cluster
Users that are interested in docker-hadoop-cluster are comparing it to the libraries listed below
Sorting:
- ☆70Updated 2 years ago
- Apache Spark Docker Image☆68Updated 6 years ago
- Spark Streaming HBase Example☆96Updated 9 years ago
- Example code for Kudu☆77Updated 6 years ago
- Few scripts to automate daily data loads from RDBMS to Partitioned Avro Hive table☆29Updated 10 years ago
- Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside in HBase region servers☆69Updated 2 years ago
- Example programs and scripts for accessing parquet files☆30Updated 7 years ago
- Ambari service for Presto☆44Updated 4 months ago
- Companion Code for Using Flume Book☆32Updated 9 years ago
- Ansible playbooks to construct distributed computing environments☆62Updated 3 years ago
- A demo repository for "streaming etl" with Apache Flink☆44Updated 8 years ago
- ☆54Updated 10 years ago
- ElasticSearch integration for Apache Spark☆47Updated 9 years ago
- spark + drools☆102Updated 2 years ago
- spark summit 2017 SanFrancisco☆97Updated 7 years ago
- Dockerized HDP Cluster☆84Updated 7 years ago
- High performance HBase / Spark SQL engine☆28Updated 2 years ago
- Code repository for the book - Mastering Flink by Tanmay Deshpande☆74Updated 8 years ago
- Visualize your HDFS cluster usage☆229Updated 4 years ago
- ☆105Updated 5 years ago
- Kafka Connect to Hbase☆43Updated 4 years ago
- Demos around Ambari Views, Services, Blueprints☆63Updated 9 years ago
- Ambari stack for easily installing and managing Redis on HDP cluster☆14Updated 9 years ago
- Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet☆28Updated 11 years ago
- hadoop-spark-hive-cluster-docker☆52Updated 7 years ago
- An Apache Flume Sink implementation to publish data to Apache Kafka☆59Updated 10 years ago
- ☆26Updated 5 years ago
- ☆58Updated 10 years ago
- Distributed SQL query engine for running interactive analytic queries against big data sources.☆44Updated 8 years ago
- 基于Yarn的容器调度引擎(container scheduler based on yarn)☆36Updated 9 years ago