felipeam86 / cachesql
Fast, resilient and reproducible data analysis with cached SQL queries
β30Updated last year
Alternatives and similar repositories for cachesql:
Users that are interested in cachesql are comparing it to the libraries listed below
- πΎ PdpCLI is a pandas DataFrame processing CLI tool which enables you to build a pandas pipeline from a configuration file.β15Updated last year
- Decorators that logs stats.β109Updated 2 weeks ago
- Feature engineering library that helps you keep track of feature dependencies, documentation and schemaβ28Updated 3 years ago
- Automated Jupyter notebook testing. πβ41Updated last year
- β29Updated last year
- Building an API with the FastAPI framework to serve a scikit-learn model.β18Updated 6 years ago
- captures logs and makes cron more funβ75Updated 6 months ago
- Comparing Polars to Pandas and a small introductionβ43Updated 3 years ago
- A small python library that can clump lists of data together.β149Updated 3 years ago
- Automated Exploratory Data Analysis. Simplifying Data Explorationβ34Updated 4 years ago
- Declarative layer for your database.β37Updated 2 years ago
- WhyProfiler is a CPU profiler for Jupyter notebook that not only identifies hotspots but can suggest faster alternatives.β44Updated 3 years ago
- Create animated and pretty Pandas Dataframeβ117Updated last year
- Python Data Collection Libraryβ45Updated 3 years ago
- Set-oriented Operations in Pandasβ24Updated 4 years ago
- A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn frβ¦β57Updated 3 years ago
- File processing pipelinesβ86Updated 2 years ago
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.β75Updated last year
- Server that simplifies connecting pandas to a realtime data feed, testing hypothesis and visualizing results in a web browserβ33Updated last year
- Kedro Plugin to support running workflows on Kubeflow Pipelinesβ53Updated 6 months ago
- β21Updated 7 months ago
- Python stream processing for humansβ185Updated last month
- A small Python library for one-sided tolerance bounds and two-sided tolerance intervals.β16Updated last year
- manipulate pandas dataframes from the comfort of your browserβ171Updated 3 years ago
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ toβ¦β29Updated 3 months ago
- A collection of python utility functionsβ11Updated 8 months ago
- Open source bits of athenian-api.β19Updated last year
- SciKIt-learn Pipeline in PAndasβ42Updated last year
- Vinum is a SQL processor for Python, designed for data analysis workflows and in-memory analytics.β65Updated 3 years ago
- Swiple enables you to easily observe, understand, validate and improve the quality of your dataβ82Updated this week