A consistent table management library in python
☆160May 15, 2023Updated 2 years ago
Alternatives and similar repositories for kartothek
Users that are interested in kartothek are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A factory for simplekv-Store-based storage classes.☆24Jan 13, 2024Updated 2 years ago
- A minimal key-value store interface for binary data (maintained fork of simplekv).☆17Mar 16, 2026Updated last week
- Pandas ExtensionDType/Array backed by Apache Arrow☆232Feb 22, 2023Updated 3 years ago
- Flat files, flat land.☆26Updated this week
- Run-length encoded arrays for pandas.☆22May 16, 2023Updated 2 years ago
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆655Mar 1, 2026Updated 3 weeks ago
- A semaphore service for distributed systems☆39Mar 18, 2026Updated last week
- Postgraas is a super simple PostgreSQL-as-a-service☆29Apr 6, 2020Updated 5 years ago
- A simple key-value store for binary data.☆159Oct 27, 2023Updated 2 years ago
- SQL on dataframes - pandas and dask☆64Apr 25, 2018Updated 7 years ago
- implementation of Cyclic Boosting machine learning algorithms☆95Sep 2, 2024Updated last year
- SQLAlchemy dialect for EXASOL☆36Mar 17, 2026Updated last week
- A multi-tenant server for securely deploying and managing Dask clusters.☆143Mar 2, 2026Updated 3 weeks ago
- parquet dedupe estimator☆25Feb 20, 2026Updated last month
- Intake is a lightweight package for finding, investigating, loading and disseminating data.☆1,071Updated this week
- Subsumed into xnd☆25Aug 30, 2023Updated 2 years ago
- ☆17Dec 7, 2022Updated 3 years ago
- ArrayViews: creating specific views to array storage objects☆16Feb 6, 2019Updated 7 years ago
- IPython magic for parallel profiling (like `%time`, but parallel)☆72Jul 17, 2017Updated 8 years ago
- A command line tool to query an ODBC data source and write the result into a parquet file.☆251Updated this week
- T4 is now in production as Quilt 3☆64Jun 4, 2019Updated 6 years ago
- Prediction of traffic patterns in bike sharing systems. Including dashboard for clustering analysis of stations in bike share networks ba…☆11Sep 22, 2022Updated 3 years ago
- An optimal space run-length Burrows-Wheeler transform full-text index☆27Oct 28, 2021Updated 4 years ago
- Random Forests for Change Point Detection☆60Dec 12, 2025Updated 3 months ago
- Code and performance tests to demonstrate the COUNTLESS algorithm. https://medium.com/@willsilversmith/countless-high-performance-2x-down…☆10Oct 23, 2019Updated 6 years ago
- High performance, editable, stylable datagrids in jupyter and jupyterlab☆113Updated this week
- A graph-inspired data structure for determining likely chains of sequences from breadcrumbs of evidence☆17Jun 29, 2021Updated 4 years ago
- Cyber threat intelligence crates for Rust☆16Jan 22, 2024Updated 2 years ago
- General purpose, language-agnostic Continuous Benchmarking (CB) framework☆35Apr 15, 2020Updated 5 years ago
- A query and aggregation framework for Bcolz (W2013-01)☆56Jul 9, 2024Updated last year
- IP Address dtype and block for pandas☆106Jul 31, 2023Updated 2 years ago
- ⚡️ An efficient cache for the execution of dask graphs.☆71Nov 1, 2023Updated 2 years ago
- interactive code annotations for JupyterLab and IPython☆18Mar 13, 2019Updated 7 years ago
- Singular Genomics Demultiplexing Tool☆16Mar 5, 2024Updated 2 years ago
- Perform high-speed calculations on columnar data without creating intermediate objects.☆81Nov 8, 2018Updated 7 years ago
- Data loader for the Apache Arrow format.☆63Mar 1, 2026Updated 3 weeks ago
- An opinionated open source deployment of jupyterhub based on an Slurm job scheduler.☆30Sep 30, 2024Updated last year
- Python package for dynamic system estimation of time series☆40Oct 4, 2020Updated 5 years ago
- Bloom + C++☆17Sep 12, 2017Updated 8 years ago