dask / fastparquetLinks
python implementation of the parquet columnar file format.
☆833Updated 2 months ago
Alternatives and similar repositories for fastparquet
Users that are interested in fastparquet are comparing it to the libraries listed below
Sorting:
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆636Updated last month
- python implementation of the parquet columnar file format.☆350Updated 3 years ago
- Extended pickling support for Python objects☆1,768Updated 2 months ago
- Intake is a lightweight package for finding, investigating, loading and disseminating data.☆1,048Updated 2 weeks ago
- Scalable Machine Learning with Dask☆938Updated 3 weeks ago
- Distributed SQL Engine in Python using Dask☆405Updated 9 months ago
- PyAthena is a Python DB API 2.0 (PEP 249) client for Amazon Athena.☆475Updated last week
- A distributed task scheduler for Dask☆1,629Updated this week
- S3 Filesystem☆940Updated last week
- Real-time stream processing for python☆1,265Updated 6 months ago
- Fast Avro for Python☆672Updated 2 weeks ago
- A specification that python filesystems should adhere to.☆1,169Updated last week
- Docker images for dask☆241Updated last week
- 🚎 Notebook sharing hub☆500Updated last year
- Immutable and statically-typeable DataFrames with runtime type and data validation☆461Updated last week
- A Python package to manage extremely large amounts of data☆1,333Updated last month
- Data Migration for the Blaze Project☆1,002Updated 2 years ago
- Pandas ExtensionDType/Array backed by Apache Arrow☆230Updated 2 years ago
- Describing statistical models in Python using symbolic formulas☆969Updated this week
- Jupyter magics and kernels for working with remote Spark clusters☆1,355Updated this week
- JupyterLab extension for Dask☆322Updated 3 months ago
- Robust and reusable Executor for joblib☆566Updated last week
- sqldf for pandas☆1,348Updated 10 months ago
- Fast NumPy array functions written in C☆1,117Updated 2 weeks ago
- Fast numerical array expression evaluator for Python, NumPy, Pandas, PyTables and more☆2,319Updated last month
- Pythonic file-system interface for Google Cloud Storage☆364Updated last week
- Airspeed Velocity: A simple Python benchmarking tool with web-based reporting☆919Updated last week
- Dask tutorial☆1,853Updated last year
- A library for defensive data analysis.☆500Updated 5 years ago
- Easy pipelines for pandas DataFrames.☆720Updated 7 months ago