misqe / atlas-dockerLinks
docker for apache-atlas embedded-cassandra-solr
☆23Updated 6 years ago
Alternatives and similar repositories for atlas-docker
Users that are interested in atlas-docker are comparing it to the libraries listed below
Sorting:
- This repository is to help with the Partner Demonstration of the Apache Atlas project.☆30Updated 10 years ago
- Build configuration-driven ETL pipelines on Apache Spark☆161Updated 3 years ago
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆73Updated 6 years ago
- DataQuality for BigData☆147Updated 2 years ago
- A Spark Atlas connector to track data lineage in Apache Atlas☆264Updated 3 years ago
- This Apache Atlas is built from the latest release source tarball and patched to be run in a Docker container.☆143Updated 2 years ago
- Plugin for Presto to allow addition of user functions easily☆119Updated 4 years ago
- Python client for Hadoop® YARN API☆109Updated 3 years ago
- Star Schema Benchmark using the Hive / Druid Integration☆30Updated 8 years ago
- StreamLine - Streaming Analytics☆166Updated 2 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆40Updated 8 years ago
- A Spark metrics sink that pushes to InfluxDb☆51Updated 5 years ago
- Spark metrics related custom classes and sinks (e.g. Prometheus)☆184Updated 3 years ago
- Multiple node presto cluster on docker container☆126Updated 3 years ago
- ☆103Updated 5 years ago
- hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.☆28Updated 8 years ago
- ACID Data Source for Apache Spark based on Hive ACID☆96Updated 4 years ago
- Ambari stack service for installing and managing Apache Airflow on HDP cluster☆58Updated 7 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆91Updated last year
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆281Updated 7 years ago
- ☆240Updated 4 years ago
- Hadoop Data Pipeline using Falcon☆15Updated 9 years ago
- Demonstrates how to submit a job to Spark on HDP directly via YARN's REST API from any workstation☆23Updated 9 years ago
- Spark Structured Streaming / Kafka / Cassandra / Elastic☆184Updated 2 years ago
- ☆63Updated 6 years ago
- Schema Registry☆17Updated last year
- Collection of tools for bootstrapping Apache Ambari & deploying clusters☆83Updated 6 years ago
- Apache NiFi example flows☆210Updated 6 years ago
- Structured Streaming Machine Learning example with Spark 2.0☆94Updated 8 years ago
- An Integrated and collaborative cloud environment for building and running Spark applications on PKS/Kubernetes☆84Updated 5 years ago