SvenskaSpel / cobra-policytoolLinks
Manage Apache Atlas and Ranger configuration for your Hadoop environment.
☆16Updated 4 years ago
Alternatives and similar repositories for cobra-policytool
Users that are interested in cobra-policytool are comparing it to the libraries listed below
Sorting:
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆70Updated 3 months ago
- Avro Schema Shredder is a REST API that enables storage of Avro Schemas in Apache Atlas. This API enables an organization to use Apache A…☆13Updated 8 years ago
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆62Updated last year
- This repository is to help with the Partner Demonstration of the Apache Atlas project.☆30Updated 10 years ago
- A K8s-based infrastructure for analytics☆24Updated 5 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆91Updated last year
- Data Brewery is an ETL (Extract-Transform-Load) program that connect to many data sources (cloud services, databases, ...) and manage dat…☆16Updated 4 years ago
- Apache DataLab (incubating)☆152Updated 2 years ago
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆73Updated 6 years ago
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Updated 5 years ago
- Scala SDK for working with Snowplow enriched events in Spark, AWS Lambda, Flink et al.☆21Updated last year
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆52Updated 5 months ago
- type-class based data cleansing library for Apache Spark SQL☆78Updated 6 years ago
- ETLy is an add-on dashboard service on top of Apache Airflow.☆68Updated 2 years ago
- Data ingestion library for Amundsen to build graph and search index☆204Updated last year
- Basic framework utilities to quickly start writing production ready Apache Spark applications☆36Updated 11 months ago
- Demonstration of a Hive Input Format for Iceberg☆26Updated 4 years ago
- Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.☆20Updated 4 years ago
- Airflow workflow management platform chef cookbook.☆71Updated 6 years ago
- Big Data Processing Framework - Unified Data API or SQL on Any Storage☆248Updated 4 months ago
- SQL data model for working with Snowplow web data. Supports Redshift and Looker. Snowflake and BigQuery coming soon☆61Updated 5 years ago
- A facebook for data☆26Updated 6 years ago
- Cloud based Data Platform based on Apache Spark☆27Updated 2 months ago
- A bridge to Apache Atlas for provenance metadata created in course of using Apache NiFi☆15Updated 2 years ago
- ☆81Updated 2 years ago
- Sherlock is an anomaly detection service built on top of Druid☆155Updated last year
- Real-world Spark pipelines examples☆83Updated 7 years ago
- Schema Registry integration for Apache Spark☆40Updated 3 years ago
- Spark Scala docker container sample for AWS testing - EKS & S3☆24Updated 7 years ago
- Apache Spark ETL Utilities☆39Updated last year