fsspec / adlfs
fsspec-compatible Azure Datake and Azure Blob Storage access
☆175Updated last month
Related projects: ⓘ
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆130Updated this week
- A data modelling layer built on top of polars and pydantic☆197Updated last year
- Black for Databricks notebooks☆44Updated last month
- Read Apache Arrow batches from ODBC data sources in Python☆54Updated 2 weeks ago
- A kedro plugin to use pandera in your kedro projects☆33Updated last week
- Pandas ExtensionDType/Array backed by Apache Arrow☆229Updated last year
- SQLAlchemy driver for DuckDB☆336Updated this week
- Databricks SQL Connector for Python☆153Updated this week
- RFC document, tooling and other content related to the dataframe API standard☆99Updated 5 months ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆111Updated 5 months ago
- API for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins…☆306Updated last month
- Coming soon☆57Updated 10 months ago
- First-party plugins maintained by the Kedro team.☆91Updated this week
- A data modelling layer built on top of polars and pydantic☆272Updated 2 weeks ago
- Kedro plugin to support running workflows on Microsoft Azure ML Pipelines☆36Updated 3 weeks ago
- Native Kubernetes integration for Dask☆312Updated this week
- Extremely lightweight compatibility layer between pandas and Polars☆38Updated 4 months ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆81Updated this week
- Distributed SQL Engine in Python using Dask☆385Updated 3 weeks ago
- pathlib api extended to use fsspec backends☆238Updated this week
- Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!☆127Updated last week
- ☆89Updated last week
- Bulwark is a package for convenient property-based testing of pandas dataframes.☆223Updated 4 years ago
- High-level wrapper around BCP for high performance data transfers between pandas and SQL Server. No knowledge of BCP required!!☆125Updated this week
- A Python package that parses SQL and interprets it as methods that act upon existing pandas (or other types of) DataFrames that have been…☆98Updated 3 years ago
- A consistent table management library in python☆161Updated last year
- Turning PySpark Into a Universal DataFrame API☆279Updated this week
- Docker images for dask☆231Updated this week
- JupyterLab extension for Dask☆311Updated last year
- VSCode extension to work with Databricks☆121Updated last week