procter-gamble-oss / octopufs
OctopuFS library helps managing cloud storage, ADLSgen2 specifically. It allows you to operate on files (moving, copying, setting ACLs) in very efficient manner. Designed to work on databricks, but should work on any other platform as well.
☆11Updated last year
Alternatives and similar repositories for octopufs:
Users that are interested in octopufs are comparing it to the libraries listed below
- ☆95Updated 2 years ago
- Databricks Platform - Architecture, Security, Automation and much more!!☆50Updated last week
- Monitoring Azure Databricks jobs☆223Updated 6 months ago
- ☆76Updated 10 months ago
- A python package to help work with the apache atlas REST APIs☆172Updated 5 months ago
- Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs☆235Updated 2 months ago
- Tools for Deploying Databricks Solutions in Azure☆99Updated last year
- Connect your Spark Databricks clusters Log4J output to the Application Insights Appender☆19Updated 4 years ago
- This repo is a collection of tools to deploy, manage and operate a Databricks based Lakehouse.☆44Updated 2 months ago
- Sample code for Gen1 to Gen2 migration patterns.☆11Updated 3 years ago
- PowerShell wrapper for the Databricks API☆43Updated 10 months ago
- A cross tenant metadata driven processing framework for Azure Data Factory and Azure Synapse Analytics achieved by coupling orchestration…☆185Updated last year
- A proof of concept of how to integrate Spark Lineage in Azure Purview☆22Updated 4 years ago
- A stand-alone test framework that allows to write unit tests for Data Factory pipelines on Microsoft Fabric, Azure Data Factory and Azure…☆98Updated last week
- Azure Databricks Workshops☆92Updated 4 years ago
- A connector to ingest Azure Databricks lineage into Microsoft Purview☆93Updated last year
- Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline☆151Updated 8 months ago
- Testing framework for Databricks notebooks☆298Updated 11 months ago
- Version 1 of Technical Best Practices of Azure Databricks based on real world Customer and Technical SME inputs☆455Updated last year
- End-to-end Azure Databricks Workspace automation with Azure Pipelines☆22Updated last year
- An Azure Databricks workshop leveraging the New York Taxi and Limousine Commission Trip Records dataset☆108Updated last year
- Apache Spark Connector for Azure Cosmos DB☆203Updated last month
- Collection of Sample Databricks Spark Notebooks ( mostly for Azure Databricks )☆86Updated 6 years ago
- Complete end to end sample of doing DevOps with Azure Databricks☆69Updated 3 years ago
- ☆81Updated 2 years ago
- Extract Load Transform (ELT) framework is a metadata based batch orchestration framework for modern data platforms. Implemented using Azu…☆32Updated 2 months ago
- ☆52Updated last year
- Solution accelerator to help build Machine Learning Lineage☆33Updated 2 years ago
- SQL Queries & Alerts for Databricks System Tables access.audit Logs☆26Updated 6 months ago
- Azure Data Factory hands-on lab, self-paced. Learn how to lift & shift SSIS packages to the Cloud with ADF. Build new ETL pipelines in AD…☆137Updated last year