dylan-profiler / compressio
Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same data.
☆28Updated 2 years ago
Alternatives and similar repositories for compressio:
Users that are interested in compressio are comparing it to the libraries listed below
- An abstraction layer for parameter tuning☆36Updated 4 months ago
- Type System for Data Analysis in Python☆210Updated 5 months ago
- Set-oriented Operations in Pandas☆24Updated 4 years ago
- Automated Jupyter notebook testing. 📙☆41Updated 11 months ago
- Tools for making Prefect work better for typical data science workflows☆19Updated 2 years ago
- Python package for deduplication/entity resolution using active learning☆78Updated 4 months ago
- Function dependencies resolution and execution☆71Updated 4 years ago
- dagster scikit-learn pipeline example.☆44Updated last year
- captures logs and makes cron more fun☆72Updated 4 months ago
- pandas data creation by data classes☆49Updated 2 weeks ago
- Pipeline components that support partial_fit.☆44Updated 6 months ago
- The easiest way to integrate Kedro and Great Expectations☆53Updated 2 years ago
- A powerful data analysis package based on mathematical step functions. Strongly aligned with pandas.☆59Updated 2 weeks ago
- ipywidgets library for drawing directed acyclic graphs in jupyterlab using dagre-d3☆79Updated last month
- A scikit-learn compatible estimator based on business-rules with interactive dashboard included☆28Updated 3 years ago
- Feature engineering library that helps you keep track of feature dependencies, documentation and schema☆28Updated 2 years ago
- SciKIt-learn Pipeline in PAndas☆42Updated last year
- A collection of python utility functions☆12Updated 6 months ago
- Comparing Polars to Pandas and a small introduction☆43Updated 3 years ago
- ☆40Updated 7 months ago
- kedro cli plugin for generating a static kedro viz site (html, css, js) that can be deployed on many serverless tools.☆27Updated 2 years ago
- Using ag-Grid in Jupyter notebooks.☆57Updated 7 months ago
- Summarise and explore Pandas DataFrames☆99Updated 4 years ago
- Coming soon☆59Updated last year
- Repository to maintain infrastructure to automate Data Workflows☆34Updated 3 years ago
- A small python library that can clump lists of data together.☆147Updated 3 years ago
- Cluster tools for running Dask on Databricks☆13Updated 7 months ago
- An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks☆21Updated 2 years ago