luvres / hadoop
☆20Updated 6 years ago
Alternatives and similar repositories for hadoop:
Users that are interested in hadoop are comparing it to the libraries listed below
- Ambari stack service for easily installing and managing NTPD on HDP cluster☆14Updated 7 years ago
- Ambari service for Apache Drill☆17Updated 9 years ago
- Docker images used internally by various Teradata projects for automation, testing, etc☆40Updated 7 years ago
- High performance HBase / Spark SQL engine☆28Updated 2 years ago
- Multiple node cluster on Docker for self development.☆93Updated 6 years ago
- Here is my git repo for my Docker files related to Cloudera Hadoop CDH, to start, best is to check the documentation on https://github.co…☆56Updated 6 years ago
- Collection of examples integrating NiFi with stream process frameworks.☆59Updated 8 years ago
- SQL on HBase with Apache Phoenix in Docker☆29Updated 9 years ago
- Java Client of the Spark Job Server implementing the arranged Rest APIs☆50Updated 3 years ago
- Demos around Ambari Views, Services, Blueprints☆63Updated 9 years ago
- ☆18Updated 8 years ago
- Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside in HBase region servers☆69Updated 2 years ago
- Ambari service for Presto☆44Updated 3 months ago
- Docker Cloudera Quick Start Image☆92Updated 7 years ago
- Dockerized HDP Cluster☆84Updated 7 years ago
- Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet☆28Updated 10 years ago
- Docker image for main Apache Hadoop components (Yarn/Hdfs)☆56Updated 2 years ago
- Simple implementation of a custom parquet reader/writer☆11Updated 8 years ago
- ☆20Updated 3 years ago
- Schema Registry integration for Apache Spark☆40Updated 2 years ago
- Code to index Hive tables to Solr and Solr indexes to Hive☆48Updated 5 years ago
- Kettle Web Integrator - An easy and open way to integrate your web app with Kettle Pentaho Data Integration☆50Updated 9 years ago
- Notes about Spark Streaming in Apache Spark☆59Updated 8 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- ansible playbook to deploy cloudera hadoop components to the cluster☆52Updated 6 years ago
- Vagrant setup for creating Ambari development/test virtual machines☆81Updated 4 years ago
- NRT Sessionization with Spark Streaming landing on HDFS and putting live stats in HBase☆51Updated 10 years ago
- Configuration options and instructions on how to add JanusGraph to ambari as a service☆9Updated 7 years ago
- Ambari stack for easily installing and managing Redis on HDP cluster☆15Updated 9 years ago
- An alternative to the "hive standalone" jar for connecting Java applications to Apache Hive via JDBC☆40Updated 6 months ago