fsspec / s3fs
S3 Filesystem
☆911Updated this week
Alternatives and similar repositories for s3fs:
Users that are interested in s3fs are comparing it to the libraries listed below
- python implementation of the parquet columnar file format.☆808Updated 3 months ago
- PyAthena is a Python DB API 2.0 (PEP 249) client for Amazon Athena.☆469Updated last month
- Python pathlib-style classes for cloud storage services such as Amazon S3, Azure Blob Storage, and Google Cloud Storage.☆504Updated this week
- A specification that python filesystems should adhere to.☆1,106Updated this week
- Pythonic file-system interface for Google Cloud Storage☆354Updated 2 weeks ago
- Intake is a lightweight package for finding, investigating, loading and disseminating data.☆1,027Updated last month
- Wrapper to use boto3 resources with the aiobotocore async backend☆796Updated last month
- Native Kubernetes integration for Dask☆318Updated 3 weeks ago
- Extended pickling support for Python objects☆1,703Updated last month
- asyncio support for botocore library using aiohttp☆1,239Updated this week
- Fast Avro for Python☆654Updated 3 weeks ago
- s3path is a pathlib extension for AWS S3 Service☆213Updated 3 months ago
- A distributed task scheduler for Dask☆1,602Updated this week
- Robust and reusable Executor for joblib☆548Updated 3 months ago
- Utils for streaming large files (S3, HDFS, gzip, bz2...)☆3,266Updated 2 months ago
- Thin-wrapper around the mock package for easier use with pytest☆1,893Updated last week
- Iterative JSON parser with Pythonic interfaces☆890Updated 3 weeks ago
- Docker images for dask☆234Updated last week
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆627Updated this week
- Python library providing function decorators for configurable backoff and retry☆2,634Updated 9 months ago
- pytest fixture for benchmarking code☆1,278Updated 3 months ago
- Development tool to measure, monitor and analyze the memory behavior of Python objects in a running Python application.☆1,236Updated 7 months ago
- Simplified packaging of Python modules☆2,192Updated this week
- serialize all of Python☆2,318Updated this week
- JupyterLab extension for Dask☆316Updated 2 weeks ago
- High level asynchronous concurrency and networking framework that works on top of either trio or asyncio☆1,918Updated this week
- SQLAlchemy dialect for BigQuery☆443Updated 3 weeks ago
- Lightweight lockfile for conda environments☆505Updated last week
- 🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)☆449Updated last month
- Bulwark is a package for convenient property-based testing of pandas dataframes.☆224Updated 4 years ago