fsspec / adlfsLinks
fsspec-compatible Azure Datake and Azure Blob Storage access
☆197Updated this week
Alternatives and similar repositories for adlfs
Users that are interested in adlfs are comparing it to the libraries listed below
Sorting:
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆144Updated 2 weeks ago
- A data modelling layer built on top of polars and pydantic☆198Updated 2 years ago
- Coming soon☆61Updated last year
- IbisML is a library for building scalable ML pipelines using Ibis.☆115Updated 3 weeks ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- Python pathlib-style classes for cloud storage services such as Amazon S3, Azure Blob Storage, and Google Cloud Storage.☆571Updated last week
- Black for Databricks notebooks☆47Updated 2 months ago
- A kedro plugin to use pandera in your kedro projects☆36Updated 10 months ago
- Read Apache Arrow batches from ODBC data sources in Python☆67Updated this week
- RFC document, tooling and other content related to the dataframe API standard☆109Updated last year
- Jupyter Cell / Line Magics for DuckDB☆51Updated this week
- Possibly the fastest DataFrame-agnostic quality check library in town.☆202Updated last week
- Write your dbt models using Ibis☆70Updated 5 months ago
- SQLAlchemy driver for DuckDB☆450Updated this week
- Typed wrappers over pandas DataFrames with schema validation☆102Updated last year
- Native Kubernetes integration for Dask☆323Updated last month
- VSCode extension to work with Databricks☆132Updated last week
- First-party plugins maintained by the Kedro team.☆104Updated this week
- Bulwark is a package for convenient property-based testing of pandas dataframes.☆226Updated 5 years ago
- Dask integration for Snowflake☆30Updated 3 weeks ago
- Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!☆136Updated this week
- Pandas ExtensionDType/Array backed by Apache Arrow☆231Updated 2 years ago
- Kedro plugin to support running workflows on Microsoft Azure ML Pipelines☆39Updated this week
- Extremely lightweight compatibility layer between pandas and Polars☆41Updated last year
- Turning PySpark Into a Universal DataFrame API☆422Updated this week
- JupyterHub extension for ContainDS Dashboards☆201Updated last year
- A Python package that parses SQL and interprets it as methods that act upon existing pandas (or other types of) DataFrames that have been…☆98Updated 3 years ago
- Distributed SQL Engine in Python using Dask☆407Updated 11 months ago
- High-level wrapper around BCP for high performance data transfers between pandas and SQL Server. No knowledge of BCP required!!☆133Updated last week
- Polars plugin for stable hashing functionality☆78Updated 3 months ago