felipeam86 / cachesql
Fast, resilient and reproducible data analysis with cached SQL queries
β30Updated last year
Alternatives and similar repositories for cachesql:
Users that are interested in cachesql are comparing it to the libraries listed below
- β30Updated last year
- πΎ PdpCLI is a pandas DataFrame processing CLI tool which enables you to build a pandas pipeline from a configuration file.β15Updated last year
- Automated Jupyter notebook testing. πβ41Updated 11 months ago
- Comparing Polars to Pandas and a small introductionβ43Updated 3 years ago
- File processing pipelinesβ86Updated 2 years ago
- β21Updated 4 months ago
- Feature engineering library that helps you keep track of feature dependencies, documentation and schemaβ28Updated 2 years ago
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAMβ83Updated last year
- SciKIt-learn Pipeline in PAndasβ42Updated last year
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.β75Updated 11 months ago
- dagster scikit-learn pipeline example.β44Updated last year
- Set-oriented Operations in Pandasβ24Updated 4 years ago
- Cluster tools for running Dask on Databricksβ13Updated 7 months ago
- A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn frβ¦β55Updated 3 years ago
- A powerful data analysis package based on mathematical step functions. Strongly aligned with pandas.β59Updated 2 weeks ago
- Decorators that logs stats.β107Updated last year
- WhyProfiler is a CPU profiler for Jupyter notebook that not only identifies hotspots but can suggest faster alternatives.β44Updated 2 years ago
- Building an API with the FastAPI framework to serve a scikit-learn model.β18Updated 6 years ago
- Marshmallow Schema generator for Pandas DataFramesβ24Updated 4 years ago
- Vinum is a SQL processor for Python, designed for data analysis workflows and in-memory analytics.β65Updated 3 years ago
- Tools for making Prefect work better for typical data science workflowsβ19Updated 2 years ago
- Automated Exploratory Data Analysis. Simplifying Data Explorationβ34Updated 4 years ago
- captures logs and makes cron more funβ72Updated 4 months ago
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ toβ¦β26Updated last month
- A quicker pickleβ108Updated 2 years ago
- Repository to maintain infrastructure to automate Data Workflowsβ34Updated 3 years ago
- A mini dashboard to help find slow tests in pytest.β79Updated 6 months ago
- Public repository for versioning machine learning dataβ42Updated 3 years ago
- Makes Interactive Chart Widget, Cleans raw data, Runs baseline models, Interactive hyperparameter tuning & trackingβ55Updated 3 years ago
- Open source bits of athenian-api.β19Updated last year