JDASoftwareGroup / kartothekLinks
A consistent table management library in python
☆160Updated 2 years ago
Alternatives and similar repositories for kartothek
Users that are interested in kartothek are comparing it to the libraries listed below
Sorting:
- Pandas ExtensionDType/Array backed by Apache Arrow☆232Updated 2 years ago
- A multi-tenant server for securely deploying and managing Dask clusters.☆143Updated this week
- A factory for simplekv-Store-based storage classes.☆24Updated last year
- Native Kubernetes integration for Dask☆324Updated last week
- Caching based on computation time and storage space☆138Updated 4 years ago
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆145Updated 2 months ago
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆649Updated last week
- Useful Mutable Mappings☆72Updated 2 years ago
- nbconflux converts Jupyter Notebooks to Atlassian Confluence pages☆127Updated last year
- Bulwark is a package for convenient property-based testing of pandas dataframes.☆226Updated 5 years ago
- Deploy dask on YARN clusters☆69Updated last year
- Perform high-speed calculations on columnar data without creating intermediate objects.☆81Updated 7 years ago
- Concurrent appendable key-value storage☆107Updated last year
- Function dependencies resolution and execution☆71Updated 5 years ago
- JupyterLab extension for Dask☆326Updated 7 months ago
- A validation library for Pandas data frames using user-friendly schemas☆193Updated 2 years ago
- ipywidgets library for drawing directed acyclic graphs in jupyterlab using dagre-d3☆85Updated 3 weeks ago
- SQL on dataframes - pandas and dask☆64Updated 7 years ago
- Immutable and statically-typeable DataFrames with runtime type and data validation☆477Updated this week
- SQLAlchemy dialect for EXASOL☆36Updated this week
- A web frontend for scheduling Jupyter notebook reports☆254Updated last year
- Docker images for dask☆244Updated 3 weeks ago
- Coming soon☆62Updated 2 years ago
- Shared-memory Python object namespace with Apache Plasma. Built because of Plotly Dash, useful anywhere.☆83Updated 11 months ago
- Summarise and explore Pandas DataFrames☆99Updated 5 years ago
- Start a cluster in EC2 for dask.distributed☆105Updated 5 years ago
- kubernetes setup to bootstrap distributed on google container engine☆66Updated 6 years ago
- ☆76Updated last year
- RFC document, tooling and other content related to the dataframe API standard☆107Updated last year
- Convert pyproject.toml to environment.yaml☆132Updated 2 years ago