timveil / docker-hadoopLinks
Simple functional examples of running Hadoop + Hive in Docker with Docker Compose
☆25Updated 3 years ago
Alternatives and similar repositories for docker-hadoop
Users that are interested in docker-hadoop are comparing it to the libraries listed below
Sorting:
- Docker image for Apache Hive Metastore☆73Updated 2 years ago
- ☆48Updated 2 years ago
- A cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support loc…☆304Updated 4 months ago
- Fast desktop client for Hadoop Distributed File System☆31Updated 2 years ago
- Deploy bigdata platform using docker compose. Big data components include hadoop, hive, hbase, presto, flink, es, kafka, etc.☆149Updated last year
- ☆14Updated 3 years ago
- ☆39Updated 6 years ago
- Postgresql configured to work as metastore for Hive.☆32Updated 3 years ago
- Kafka offset committer for structured streaming query☆40Updated 4 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆103Updated 3 years ago
- Trino plugin for logging query events into a separate log file.☆40Updated 3 years ago
- Tutorial on how to setup Trino and Apache Ranger using docker☆41Updated last year
- A library developed to ease the data ETL development process.☆134Updated 3 weeks ago
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Updated 7 months ago
- Streaming application development and management system, based on Linkis and DSS, planning to provide the workflow-like graphical drag-an…☆113Updated 9 months ago
- Install hadoop cluster with ansible☆40Updated 8 years ago
- awesome,Flink,Flink CEP☆16Updated 2 years ago
- A curated list of awesome Greenplum resources, tools☆61Updated 6 years ago
- Hadoop FSImage Analyzer (HFSA)☆66Updated last week
- Storage connector for Trino☆117Updated 2 weeks ago
- ☆20Updated 2 years ago
- Apache DolphinScheduler Python API, aka PyDolphinscheduler.☆66Updated 3 weeks ago
- ORAcle database CDC (Change Data Capture)☆128Updated this week
- ☆201Updated 2 weeks ago
- A tool for developing and testing ETL and ELT processes for automating the capture, delivery and processing of information in data wareho…☆59Updated 2 years ago
- Instructions for getting started with Ververica Platform on minikube.☆95Updated 7 months ago
- Apache Flink docker image☆197Updated 3 years ago
- An alternative to the "hive standalone" jar for connecting Java applications to Apache Hive via JDBC☆41Updated last year
- Flink Controller implements a Kubernetes Custom Controller (aka Kubernetes Operator) for Apache Flink☆52Updated 2 weeks ago
- This Apache Atlas is built from the latest release source tarball and patched to be run in a Docker container.☆143Updated 2 years ago