codingtony / docker-impalaLinks
Docker image that runs an Hadoop cluster in single node mode, running Impala server version 2.0.1. Based on CDH5.
☆40Updated 9 years ago
Alternatives and similar repositories for docker-impala
Users that are interested in docker-impala are comparing it to the libraries listed below
Sorting:
- Visualize your HDFS cluster usage☆229Updated 4 years ago
- Scripts for generating Grafana dashboards for monitoring Spark jobs☆240Updated 10 years ago
- Dockerized HDP Cluster☆84Updated 7 years ago
- Support Highcharts in Apache Zeppelin☆81Updated 8 years ago
- Docker image for Apache Hive running on Tez☆25Updated 10 years ago
- Example project showing how to use Hive UDFs in Apache Spark☆55Updated 6 years ago
- Docker image for Apache Spark☆76Updated 5 years ago
- ☆70Updated 3 years ago
- Lightweight proxy to expose the UI of an Apache Spark cluster that is behind a firewall☆98Updated 5 years ago
- The SpliceSQL Engine☆170Updated 2 years ago
- A connector for SingleStore and Spark☆162Updated 2 weeks ago
- Live-updating Spark UI built with Meteor☆189Updated 4 years ago
- A super simple utility for testing Apache Hive scripts locally for non-Java developers.☆73Updated 8 years ago
- Druid indexing plugin for using Spark in batch jobs☆101Updated 3 years ago
- Docker Cloudera Quick Start Image☆92Updated 8 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 3 years ago
- Document and showcase how you can create Spark Applications which run inside Docker Containers using Apache Mesos.☆28Updated 9 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- PostgreSQL protocol gateway for Presto distributed SQL query engine☆292Updated 2 years ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆62Updated 5 years ago
- NRT Sessionization with Spark Streaming landing on HDFS and putting live stats in HBase☆50Updated 10 years ago
- Schema Registry integration for Apache Spark☆40Updated 2 years ago
- SQL for Kafka Connectors☆99Updated last year
- ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.☆279Updated 6 years ago
- Profiler for large-scale distributed java applications (Spark, Scalding, MapReduce, Hive,...) on YARN.☆128Updated 7 years ago
- Spark SQL index for Parquet tables☆134Updated 4 years ago
- Enabling Spark Optimization through Cross-stack Monitoring and Visualization☆47Updated 8 years ago
- Druid Docker☆196Updated 6 years ago
- Apache Spark and Apache Kafka integration example☆124Updated 7 years ago
- Scripts for parsing / making sense of yarn logs☆52Updated 9 years ago