SvenskaSpel / cobra-policytoolLinks
Manage Apache Atlas and Ranger configuration for your Hadoop environment.
☆16Updated 4 years ago
Alternatives and similar repositories for cobra-policytool
Users that are interested in cobra-policytool are comparing it to the libraries listed below
Sorting:
- Avro Schema Shredder is a REST API that enables storage of Avro Schemas in Apache Atlas. This API enables an organization to use Apache A…☆13Updated 8 years ago
- Data Brewery is an ETL (Extract-Transform-Load) program that connect to many data sources (cloud services, databases, ...) and manage dat…☆16Updated 4 years ago
- ☆14Updated 8 years ago
- Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.☆20Updated 3 years ago
- A small project to show how to add lineage to Atlas when using Spark as ETL tool☆12Updated 8 years ago
- Hadoop Data Pipeline using Falcon☆15Updated 9 years ago
- HDF masterclass materials☆28Updated 9 years ago
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆69Updated 4 months ago
- Schema Registry integration for Apache Spark☆40Updated 2 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 8 years ago
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆61Updated 9 months ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆88Updated last year
- Ambari Service definition for deploying R & RHadoop libraries☆18Updated 9 years ago
- A bridge to Apache Atlas for provenance metadata created in course of using Apache NiFi☆15Updated 2 years ago
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆72Updated 4 years ago
- type-class based data cleansing library for Apache Spark SQL☆78Updated 6 years ago
- Apache Spark ETL Utilities☆40Updated 8 months ago
- Scalable CDC Pattern Implemented using PySpark☆18Updated 5 years ago
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- Examples for High Performance Spark☆16Updated 7 months ago
- ☆27Updated last year
- An opinionated auto-deployer for the Hortonworks Platform☆34Updated 4 years ago
- Basic framework utilities to quickly start writing production ready Apache Spark applications☆36Updated 6 months ago
- This repository is to help with the Partner Demonstration of the Apache Atlas project.☆30Updated 9 years ago
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16Updated 6 years ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆51Updated last week
- JSON schema parser for Apache Spark☆81Updated 2 years ago
- machine learning playground☆12Updated 8 years ago
- Star Schema Benchmark using the Hive / Druid Integration☆30Updated 7 years ago
- Explore Apache Kafka data pipelines in Kubernetes.☆46Updated 4 months ago