dask / fastparquetLinks
python implementation of the parquet columnar file format.
☆834Updated 2 months ago
Alternatives and similar repositories for fastparquet
Users that are interested in fastparquet are comparing it to the libraries listed below
Sorting:
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆637Updated this week
- python implementation of the parquet columnar file format.☆353Updated 3 years ago
- Scalable Machine Learning with Dask☆938Updated last month
- S3 Filesystem☆945Updated last week
- Intake is a lightweight package for finding, investigating, loading and disseminating data.☆1,049Updated last month
- PyAthena is a Python DB API 2.0 (PEP 249) client for Amazon Athena.☆477Updated 3 weeks ago
- Distributed SQL Engine in Python using Dask☆405Updated 9 months ago
- A distributed task scheduler for Dask☆1,631Updated this week
- Extended pickling support for Python objects☆1,778Updated 2 months ago
- Describing statistical models in Python using symbolic formulas☆968Updated last week
- Pandas ExtensionDType/Array backed by Apache Arrow☆230Updated 2 years ago
- Real-time stream processing for python☆1,267Updated 7 months ago
- A Python package to manage extremely large amounts of data☆1,334Updated this week
- Easy pipelines for pandas DataFrames.