MuttData / muttlibLinks
β31Updated last year
Alternatives and similar repositories for muttlib
Users that are interested in muttlib are comparing it to the libraries listed below
Sorting:
- Automated Jupyter notebook testing. πβ41Updated last year
- Comparing Polars to Pandas and a small introductionβ44Updated 4 years ago
- Convert monolithic Jupyter notebooks π into maintainable Ploomber pipelines. πβ79Updated last year
- Feature engineering library that helps you keep track of feature dependencies, documentation and schemaβ28Updated 3 years ago
- Set-oriented Operations in Pandasβ24Updated 5 years ago
- Fast, resilient and reproducible data analysis with cached SQL queriesβ30Updated 2 years ago
- A powerful data analysis package based on mathematical step functions. Strongly aligned with pandas.β63Updated 11 months ago
- πΎ PdpCLI is a pandas DataFrame processing CLI tool which enables you to build a pandas pipeline from a configuration file.β15Updated 2 years ago
- SQL interface to Pandasβ52Updated 3 years ago
- Sample projects using Ploomber.β86Updated last year
- The easiest way to integrate Kedro and Great Expectationsβ54Updated 2 years ago
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAMβ86Updated last year
- Build and deploy a serverless data pipeline on AWS with no effort.β111Updated 2 years ago
- βοΈ Export Ploomber pipelines to Kubernetes (Argo), Airflow, AWS Batch, SLURM, and Kubeflow.β45Updated 8 months ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withouβ¦β114Updated 3 weeks ago
- Python Data Collection Libraryβ45Updated 4 years ago
- kedro plugin to automatically construct pipelines using pytest style pattern matchingβ22Updated 3 months ago
- Decorators that logs stats.β115Updated 8 months ago
- A template for an AWS Lambda function that triggers Prefect Flow Runsβ20Updated 4 years ago
- SciKIt-learn Pipeline in PAndasβ42Updated 2 years ago
- A small python library that can clump lists of data together.β149Updated 4 years ago
- Distributed persistent Task Queue running on Daskβ38Updated 2 years ago
- Dvc + Streamlit = β€οΈβ40Updated 2 years ago
- PyNLP Lib is an open source Python NLP library that provides functionality for both web and local developmentβ50Updated 3 years ago
- Material for Talk Python Training course on Getting Started with Dask.β30Updated 2 years ago
- Create animated and pretty Pandas Dataframeβ119Updated 2 years ago
- Cloud-agnostic Python APIβ60Updated last year
- MLOps simplified. One-stop AI delivery platform, all the features you need.β105Updated last week
- Anovos - An Open Source Library for Scalable feature engineering Using Apache-Sparkβ74Updated 2 years ago
- A utility for labeling clusters of text data.β28Updated 4 years ago