lucasmsp / docker-atlas
Cluster in docker with Apache Atlas and a minimal Hadoop ecosystem to perform some basic experiments.
☆26Updated 8 months ago
Alternatives and similar repositories for docker-atlas:
Users that are interested in docker-atlas are comparing it to the libraries listed below
- This Apache Atlas is built from the latest release source tarball and patched to be run in a Docker container.☆141Updated last year
- Tutorial on how to setup Trino and Apache Ranger using docker☆41Updated 9 months ago
- Apache Hive Metastore as a Standalone server in Docker☆74Updated 8 months ago
- Repository for building docker image, with open-source applications☆26Updated last year
- Adapter for dbt that executes dbt pipelines on Apache Flink☆95Updated last year
- Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi☆113Updated last year
- ☆15Updated 2 years ago
- A playground to experience Gravitino☆44Updated last month
- A Micosoft Power BI Custom Connector allowing you to import Trino data into Power BI.☆68Updated 3 months ago
- datacollector-oss☆95Updated 9 months ago
- ☆40Updated 4 years ago
- Repository of helm charts for deploying DataHub on a Kubernetes cluster☆183Updated 2 weeks ago
- Presto Trino with Apache Hive Postgres metastore☆41Updated 7 months ago
- Cluster manager for Apache Doris☆177Updated last year
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆98Updated 2 years ago
- ☆14Updated 4 years ago
- This is a GitHub for all of my NiFi Templates☆46Updated 4 years ago
- a collection of plugins that can be used with but can't or won't be shipped with Apache Hop☆12Updated 3 weeks ago
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆121Updated last month
- Mirror of Apache Ranger☆15Updated last year
- ☆79Updated last year
- Data product portal created by Dataminded☆184Updated this week
- Delta Lake examples☆224Updated 6 months ago
- ☆265Updated 6 months ago
- This project demonstrates Real-Time streaming of CDC data from MySql to Apache Iceberg using Flink SQL Client for faster data analytics a…☆23Updated last year
- Sparglim✨ makes PySpark App Configurable and Deploy Spark Connect Server Easier!☆37Updated 2 months ago
- a dbt adapter for Apache Doris☆25Updated last year
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆43Updated 2 years ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆77Updated 3 weeks ago
- ☆193Updated last week