michalmiklas / atlas-dockerLinks

docker for apache-atlas embedded-cassandra-solr

☆23

Alternatives and similar repositories for atlas-docker

Users that are interested in atlas-docker are comparing it to the libraries listed below

Sorting:

shivajid / atlas
This repository is to help with the Partner Demonstration of the Apache Atlas project.
☆30Updated 10 years ago
agile-lab-dev / DataQuality
DataQuality for BigData
☆145Updated 2 years ago
cloudera-labs / envelope
Build configuration-driven ETL pipelines on Apache Spark
☆162Updated 3 years ago
qubole / presto-udfs
Plugin for Presto to allow addition of user functions easily
☆119Updated 4 years ago
Lewuathe / docker-trino-cluster
Multiple node presto cluster on docker container
☆126Updated 3 years ago
hortonworks-spark / spark-atlas-connector
A Spark Atlas connector to track data lineage in Apache Atlas
☆265Updated 3 years ago
qubole / spark-acid
ACID Data Source for Apache Spark based on Hive ACID
☆97Updated 4 years ago
hortonworks / registry
Schema Registry
☆17Updated last year
ExpediaGroup / circus-train
Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.
☆91Updated last year
bolcom / hive_compared_bq
hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.
☆28Updated 8 years ago
rssanders3 / airflow-spark-operator-plugin
A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator
☆73Updated 6 years ago
dataArtisans / flink-streaming-demo
☆240Updated 4 years ago
mganta / sprue
spark + drools
☆103Updated 3 years ago
sburn / docker-apache-atlas
This Apache Atlas is built from the latest release source tarball and patched to be run in a Docker container.
☆143Updated 2 years ago
miho120 / ambari-airflow-mpack
Ambari stack service for installing and managing Apache Airflow on HDP cluster
☆58Updated 7 years ago
hortonworks-spark / spark-llap
☆103Updated 5 years ago
FRosner / drunken-data-quality
Spark package for checking data quality
☆222Updated 5 years ago
maropu / spark-sql-server
Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol
☆34Updated 3 years ago
AbsaOSS / spline-spark-agent
Spline agent for Apache Spark
☆200Updated 2 weeks ago
banzaicloud / spark-metrics
Spark metrics related custom classes and sinks (e.g. Prometheus)
☆184Updated 3 years ago
abajwa-hw / ambari-workshops
Demos around Ambari Views, Services, Blueprints
☆63Updated 9 years ago
bmc / spark-hive-udf
Example project showing how to use Hive UDFs in Apache Spark
☆55Updated 6 years ago
gateway-experiments / hadoop-yarn-api-python-client
Python client for Hadoop® YARN API
☆109Updated 3 years ago
yaooqinn / spark-authorizer
A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…
☆181Updated 3 years ago
bernhard-42 / spark-yarn-rest-api
Demonstrates how to submit a job to Spark on HDP directly via YARN's REST API from any workstation
☆23Updated 9 years ago
lightcopy / parquet-index
Spark SQL index for Parquet tables
☆134Updated 4 years ago
cartershanklin / hive-druid-ssb
Star Schema Benchmark using the Hive / Druid Integration
☆30Updated 8 years ago
yamrcraft / etl-light
A light Kafka to HDFS/S3 ETL library based on Apache Spark
☆40Updated 8 years ago
hortonworks / streamline
StreamLine - Streaming Analytics
☆164Updated 2 years ago
ExpediaGroup / waggle-dance
Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.
☆284Updated last month