fsspec / adlfs
fsspec-compatible Azure Datake and Azure Blob Storage access
☆188Updated 4 months ago
Alternatives and similar repositories for adlfs:
Users that are interested in adlfs are comparing it to the libraries listed below
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆140Updated 2 months ago
- A data modelling layer built on top of polars and pydantic☆194Updated last year
- Coming soon☆61Updated last year
- RFC document, tooling and other content related to the dataframe API standard☆108Updated last year
- Read Apache Arrow batches from ODBC data sources in Python☆65Updated this week
- Jupyter Cell / Line Magics for DuckDB☆48Updated 2 months ago
- Write your dbt models using Ibis☆64Updated last month
- Pandas ExtensionDType/Array backed by Apache Arrow☆229Updated 2 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- Possibly the fastest DataFrame-agnostic quality check library in town.☆185Updated this week
- First-party plugins maintained by the Kedro team.☆99Updated this week
- IbisML is a library for building scalable ML pipelines using Ibis.☆108Updated 3 months ago
- A kedro plugin to use pandera in your kedro projects☆35Updated 6 months ago
- Turning PySpark Into a Universal DataFrame API☆385Updated this week
- Typed wrappers over pandas DataFrames with schema validation☆101Updated last year
- A declarative, 🐻❄️-native data frame validation library.☆70Updated this week
- A repository of runnable examples using ibis☆43Updated 9 months ago
- Python pathlib-style classes for cloud storage services such as Amazon S3, Azure Blob Storage, and Google Cloud Storage.☆534Updated this week
- Dask integration for Snowflake☆30Updated 5 months ago
- 🪴 Nebari - your open source data science platform☆290Updated last week
- pathlib api extended to use fsspec backends☆298Updated this week
- Extremely lightweight compatibility layer between pandas and Polars☆41Updated 11 months ago
- Native Kubernetes integration for Dask☆321Updated this week
- Black for Databricks notebooks☆44Updated 3 months ago
- Make your Kedro experience snazzy☆35Updated 2 years ago
- Kedro plugin to support running workflows on Microsoft Azure ML Pipelines☆36Updated this week
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆76Updated last month
- JupyterHub extension for ContainDS Dashboards☆201Updated 8 months ago
- A multi-tenant server for securely deploying and managing Dask clusters.☆140Updated last week
- Bulwark is a package for convenient property-based testing of pandas dataframes.☆224Updated 4 years ago