pippozq / hadoop-ansible
Install hadoop cluster with ansible
☆39Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for hadoop-ansible
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Updated 8 months ago
- Discover Flink clusters on Hadoop YARN for Prometheus☆23Updated 4 years ago
- phoenix☆12Updated 2 years ago
- Simple functional examples of running Hadoop + Hive in Docker with Docker Compose☆25Updated last year
- ansible playbook to deploy cloudera hadoop components to the cluster☆52Updated 6 years ago
- Hadoop FSImage Analyzer (HFSA)☆57Updated last week
- Examples for how to use the Flink Docker images in a variety of ways☆91Updated 3 years ago
- Demo showcasing Spark Streaming, Kafka, Kudu - all in Python☆27Updated 7 years ago
- Flume JSON Interceptor Plugin☆15Updated 2 years ago
- Prometheus jmx_exporter configurations for Cloudera Hadoop☆37Updated 6 years ago
- Spark Clickhouse Connector☆72Updated 4 years ago
- Exports Hadoop HDFS content statistics to Prometheus☆152Updated last week
- Export Hadoop YARN (resource-manager) metrics in prometheus format☆50Updated 3 weeks ago
- Flink image for Kubernetes that fixes Jobmanage connection issue☆23Updated 6 years ago
- ☆10Updated 4 years ago
- flink sql redis 连接器☆12Updated 11 months ago
- some useful User Defined Functions(UDF) for both PrestoSQL and TrinoDB☆18Updated last year
- Instructions for getting started with Ververica Platform on minikube.☆89Updated 5 months ago
- Ansible roles to install an Spark Standalone cluster (HDFS/Spark/Jupyter Notebook) or Ambari based Spark cluster☆61Updated 9 months ago
- Flink Kubernetes Toolbox is the Swiss Army knife for deploying and managing Apache Flink on Kubernetes☆54Updated 10 months ago
- Java library for managing Apache Flink via the Monitoring REST API☆56Updated 2 years ago
- A web application for submitting spark application☆8Updated 3 years ago
- Guardian of Waterdrop and Spark☆30Updated last year
- facebook presto connectors☆49Updated 3 years ago
- Kafka manager, monitor consumer based kafka information, include near realtime offset/lag information.☆32Updated 5 years ago
- Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies…☆21Updated 5 years ago
- Cloudera deployment automation with Ansible☆198Updated 4 years ago
- ☆8Updated 6 years ago