felipeam86 / cachesql
Fast, resilient and reproducible data analysis with cached SQL queries
☆30Updated last year
Alternatives and similar repositories for cachesql:
Users that are interested in cachesql are comparing it to the libraries listed below
- 🐾 PdpCLI is a pandas DataFrame processing CLI tool which enables you to build a pandas pipeline from a configuration file.☆15Updated last year
- Automated Jupyter notebook testing. 📙☆41Updated last year
- Decorators that logs stats.☆111Updated last month
- captures logs and makes cron more fun☆76Updated 7 months ago
- Feature engineering library that helps you keep track of feature dependencies, documentation and schema☆28Updated 3 years ago
- ☆29Updated last year
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆77Updated last year
- File processing pipelines☆86Updated 3 years ago
- Set-oriented Operations in Pandas☆24Updated 4 years ago
- SciKIt-learn Pipeline in PAndas☆42Updated last year
- A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn fr…☆57Updated 3 years ago
- ☆21Updated 8 months ago
- AsyncIO serving for data science models☆24Updated 2 years ago
- Comparing Polars to Pandas and a small introduction☆43Updated 3 years ago
- Fuzzy joins for python pandas - easily join different datasets☆59Updated 4 years ago
- Marshmallow Schema generator for Pandas DataFrames☆24Updated 4 years ago
- A powerful data analysis package based on mathematical step functions. Strongly aligned with pandas.☆60Updated 4 months ago
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆84Updated last year
- WhyProfiler is a CPU profiler for Jupyter notebook that not only identifies hotspots but can suggest faster alternatives.☆44Updated 3 years ago
- A small python library that can clump lists of data together.☆149Updated 3 years ago
- Vinum is a SQL processor for Python, designed for data analysis workflows and in-memory analytics.☆65Updated 3 years ago
- A small Python library for one-sided tolerance bounds and two-sided tolerance intervals.☆16Updated 2 years ago
- An abstraction layer for parameter tuning☆35Updated 8 months ago
- Python Data Collection Library☆45Updated 3 years ago
- Tools for making Prefect work better for typical data science workflows☆18Updated 3 years ago
- Use pathlib syntax to easily work with Pandas series containing file paths.☆69Updated last year
- 🎛 Distributed machine learning made simple.☆49Updated 2 years ago
- pandas data creation by data classes☆51Updated 4 months ago
- A simple script to help schedule Jupyter Notebook execution and storing of the results using Papermill☆27Updated 5 years ago
- 🍦 Deployment tool for online machine learning models☆97Updated 2 years ago