JDASoftwareGroup / kartothek
A consistent table management library in python
☆159Updated last year
Alternatives and similar repositories for kartothek:
Users that are interested in kartothek are comparing it to the libraries listed below
- Pandas ExtensionDType/Array backed by Apache Arrow☆229Updated 2 years ago
- A factory for simplekv-Store-based storage classes.☆24Updated last year
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆140Updated 3 months ago
- A multi-tenant server for securely deploying and managing Dask clusters.☆140Updated 3 weeks ago
- Native Kubernetes integration for Dask☆321Updated last week
- Bulwark is a package for convenient property-based testing of pandas dataframes.☆224Updated 4 years ago
- Useful Mutable Mappings☆70Updated last year
- SQLAlchemy dialect for Turbodbc☆23Updated 3 weeks ago
- Caching based on computation time and storage space☆136Updated 4 years ago
- Concurrent appendable key-value storage☆106Updated 9 months ago
- Function dependencies resolution and execution☆70Updated 4 years ago
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆635Updated this week
- A minimal key-value store interface for binary data (maintained fork of simplekv).☆17Updated this week
- Run-length encoded arrays for pandas.☆21Updated last year
- Docker images for dask☆240Updated this week
- SQL on dataframes - pandas and dask☆64Updated 7 years ago
- JupyterLab extension for Dask☆322Updated 2 months ago
- ☆76Updated 8 months ago
- Marshmallow Schema generator for Pandas DataFrames☆24Updated 4 years ago
- Immutable and statically-typeable DataFrames with runtime type and data validation☆457Updated this week
- A filesystem-like contents manager for multiple backends in Jupyter☆218Updated this week
- kubernetes setup to bootstrap distributed on google container engine☆67Updated 5 years ago
- SQLAlchemy dialect for EXASOL☆35Updated this week
- A validation library for Pandas data frames using user-friendly schemas☆191Updated 2 years ago
- Deploy dask on YARN clusters☆69Updated 8 months ago
- A web frontend for scheduling Jupyter notebook reports☆252Updated 5 months ago
- Spawn JupyterHub single user notebook servers in Hadoop/YARN containers.☆19Updated last week
- Tools for making Prefect work better for typical data science workflows☆18Updated 3 years ago
- Coming soon☆61Updated last year
- ⚡️ An efficient cache for the execution of dask graphs.☆71Updated last year