PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection
☆17Jan 12, 2017Updated 9 years ago
Alternatives and similar repositories for pyspark-atlas
Users that are interested in pyspark-atlas are comparing it to the libraries listed below
Sorting:
- Ambari View for the Ambari Store☆15Sep 21, 2015Updated 10 years ago
- Ambari stack service for easily installing and managing Solr on HDP cluster☆18Nov 13, 2018Updated 7 years ago
- Livy and Zeppelin services for Cloudera Manager and CDH using CSDs and Parcels☆22Aug 16, 2018Updated 7 years ago
- Demonstrates how to submit a job to Spark on HDP directly via YARN's REST API from any workstation☆23Apr 18, 2016Updated 9 years ago
- golang tools for Apache Solr☆28Jan 29, 2026Updated last month
- Apache Solr interpreter for Apache Zeppelin☆28Jun 14, 2023Updated 2 years ago
- ☆10Jun 29, 2021Updated 4 years ago
- Ambari Service for OpenTSDB☆34Dec 14, 2016Updated 9 years ago
- Apache Geode on Kubernetes☆10Oct 19, 2019Updated 6 years ago
- Data Catalog for Databases and Data Warehouses☆36Jan 15, 2024Updated 2 years ago
- ☆10Mar 29, 2022Updated 3 years ago
- Collect and aggregate on spark events for profitz☆10Apr 22, 2022Updated 3 years ago
- ☆10May 28, 2025Updated 9 months ago
- Simple graphical interface for UniFi Gateway WAN bandwidth data usage statistics.☆11Mar 27, 2022Updated 3 years ago
- A Spark Atlas connector to track data lineage in Apache Atlas☆266Nov 16, 2022Updated 3 years ago
- A DropWizard wrapper around Apache Tika.☆10Dec 22, 2016Updated 9 years ago
- An HDFS backed ContentsManager implementation for Jupyter☆12Apr 8, 2024Updated last year
- Various tools to help plan HDP and CDH upgrades to CDP☆14Dec 7, 2021Updated 4 years ago
- nanomsg for iojs a.k.a NodeJS https://github.com/nanomsg/nanomsg☆10Jul 20, 2015Updated 10 years ago
- Simple role for deploying Elixir Exrm releases.☆10Jan 28, 2016Updated 10 years ago
- Docker data container☆11Aug 23, 2015Updated 10 years ago
- A data analysis GUI for R☆11May 5, 2025Updated 10 months ago
- Safe and easy way for storing and retrieving sensitive data☆13Aug 23, 2022Updated 3 years ago
- ☆13Jan 4, 2023Updated 3 years ago
- ☆10Updated this week
- Data structure that maps entries to numeric ids☆14Aug 16, 2015Updated 10 years ago
- Samples of authenticating to an Azure Key Vault vault☆13May 10, 2022Updated 3 years ago
- FOTA for Funky☆10Jul 24, 2016Updated 9 years ago
- Headless agent for test driven relevancy with Quepid.com☆11Mar 6, 2024Updated 2 years ago
- REST API for controlling Google Chrome☆13Sep 23, 2015Updated 10 years ago
- Fluorite: Apache Calcite trace analyzer☆12Apr 15, 2019Updated 6 years ago
- Automated log shipper for Kubernetes powered by annotations☆10Sep 10, 2019Updated 6 years ago
- Tool and library for generating X.509 certificates and certificate requests (mirror)☆16Oct 13, 2021Updated 4 years ago
- A Pact Broker metrics exporter for Prometheus☆10Sep 18, 2023Updated 2 years ago
- utility to convert between positional (line and column-based) and offset (range-based) locations☆14Oct 22, 2024Updated last year
- Zookeeper Monitoring Extension for AppDynamics☆10Sep 29, 2021Updated 4 years ago
- dw etl 工具 mysql 增量、全量抽取 to hive. 合并 hive 数据表, 等数据平台清洗工具☆10Dec 21, 2016Updated 9 years ago
- ☆12Jun 26, 2023Updated 2 years ago
- A collection of python utility functions☆11Feb 11, 2026Updated 3 weeks ago