felipeam86 / cachesqlLinks
Fast, resilient and reproducible data analysis with cached SQL queries
β30Updated 2 years ago
Alternatives and similar repositories for cachesql
Users that are interested in cachesql are comparing it to the libraries listed below
Sorting:
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAMβ86Updated last year
- Automated Jupyter notebook testing. πβ41Updated last year
- Decorators that logs stats.β115Updated 9 months ago
- Feature engineering library that helps you keep track of feature dependencies, documentation and schemaβ28Updated 3 years ago
- A small python library that can clump lists of data together.β148Updated 4 years ago
- captures logs and makes cron more funβ79Updated last year
- The goal of pandas-log is to provide feedback about basic pandas operations. It provides simple wrapper functions for the most common funβ¦β216Updated 4 years ago
- β31Updated 2 years ago
- A plugin for Flake8 that checks pandas codeβ170Updated 2 years ago
- πΎ PdpCLI is a pandas DataFrame processing CLI tool which enables you to build a pandas pipeline from a configuration file.β15Updated 2 years ago
- Woodwork is a Python library that provides robust methods for managing and communicating data typing information.β156Updated 2 months ago
- A small Python library for one-sided tolerance bounds and two-sided tolerance intervals.β17Updated 2 years ago
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.β84Updated 3 months ago
- Fuzzy joins for python pandas - easily join different datasetsβ59Updated 5 years ago
- The easiest way to integrate Kedro and Great Expectationsβ54Updated 2 years ago
- A library that unifies the API for most commonly used libraries and modeling techniques for time-series forecasting in the Python ecosystβ¦β150Updated last year
- β23Updated last year
- manipulate pandas dataframes from the comfort of your browserβ174Updated 4 years ago
- Bulwark is a package for convenient property-based testing of pandas dataframes.β226Updated 5 years ago
- simple, flexible, offline capable, cloud storage with a Python path-like interfaceβ174Updated 7 months ago
- Databay is a Python interface for scheduled data transfer. It facilitates transfer of (any) data from A to B, on a scheduled interval.β188Updated 2 years ago
- A powerful data analysis package based on mathematical step functions. Strongly aligned with pandas.β63Updated 11 months ago
- SciKIt-learn Pipeline in PAndasβ42Updated 2 years ago
- Comparing Polars to Pandas and a small introductionβ44Updated 4 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withouβ¦β114Updated last month
- Kedro-Accelerator speeds up pipelines by parallelizing I/O in the background.β36Updated 3 years ago
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the sameβ¦β30Updated 3 years ago
- SQL interface to Pandasβ52Updated 4 years ago
- A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn frβ¦β57Updated 4 years ago
- Marshmallow Schema generator for Pandas DataFramesβ24Updated 5 years ago